WO 2004/041196 



PCT7US2003/034787 



Title of the Invention: 

Methods and Compositions for the Diagnosis 
of Neuroendocrine Lung Cancer 

Field of the Invention: 

5 This invention relates to methods and compositions for the diagnosis of 

neuroendocrine lung cancers. In particular, the invention concerns the use of 
cDNA microarrays to facilitate the differential diagnosis of neuroendocrine tumor 
types. 

Statement of Governmental Interest 

10 This invention was funded by NCI Intramural Research Program CCR at 

the National Institutes of Health. The United States Government has certain rights 
to this invention. 

Background of the Invention: 

The mammalian neuroendocrine system is a dispersed organ system that 
15 consists of cells found in multiple different organs. The cells of the 

neuroendocrine system function in certain ways like nerve cells and in other ways 
like cells of the endocrine (hormone-producing) glands. The neuroendocrine cells 
of the lung are of particular significance; they help control airflow and blood flow 
in the lungs and may help control growth of other types of lung cells. 

20 In some instances, neuroendocrine cells escape from normal cellular 

control and become malignant, resulting in neuroendocrine tumors. Four clinically 
distinct types of neuroendocrine tumors have been described: small cell lung 
cancer (SCLC), large cell neuroendocrine carcinoma (LCNEC), typical carcinoid 
(TC) tumors and atypical carcinoid (AC) tumors. SCLC is the most serious type of 

25 neuroendocrine lung tumor (LCNEC), and is among the most rapidly growing and 
spreading of all cancers. Large cell neuroendocrine carcinoma, typical carcinoid 
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and atypical carcinoid tumors are rare forms of cancers. Whereas SCLC accounts 
for 15-25% of total pulmonary malignancies, large cell neuroendocrine carcinoma, 
typical carcinoid and atypical carcinoid tumors collectively account for only 3-5% 
of total pulmonary malignancies. Accurate diagnosis of neuroendocrine carcinoma 
5 is important since the different tumor types are associated with markedly different 
survival rates. Small Cell Lung Cancers are associated with 5 and 10 year survival 
rates of only 9% and 5%, respectively. Large Cell Neuroendocrine Carcinoma 
presently exhibit 27% and 9%, 5 and 10 year survival rates. Atypical Carcinoid 
Tumors are associated with 5 and 10 year survival rates of 56% and 35%, 
10 respectively. In contrast, Typical Carcinoid Tumors are found to have 5 and 10 
year survival rates of nearly 90% 

Neuroendocrine tumors are reviewed by Gould, V.E. et al (2000) 
"Epithelial Tumors Of The Lung" Chest Surg Clin N Am 70:709-28, by 
DeLellis, R.A. (1997) "Proliferation Markers In Neuroendocrine tumors: 

1 5 Useful Or Useless? A Critical Reappraisal" Verb Dtsch Ges Pathol 57:53- 
61, by Travis, W.D. et al (1991) "Neuroendocrine tumors Of The Lung With 
Proposed Criteria For Large-Cell Neuroendocrine Carcinoma. An 
Ultrastructural, Immunohistochemical, And Flow Cytometric Study Of 
35 CASES" Am J Surg Pathol 75:529-53, by Cerilli, L.A. etal (2001) 

20 "Neuroendocrine Neoplasms Of The Lung" Am J Clin Pathol 1 1 6:S65-96; by 
Arrigoni, M.G. et al (1972) "Atypical Carcinoid Tumors Of The Lung," J 
Thorac Cardiovasc Surg 64:413-421; by Warren, W.H. et al (1988) "Well 
Differentiated And Small Cell Neuroendocrine Carcinomas Of The 
Lung: Two Related But Distinct Clinicopathologic Entities," Virchows 

25 Arch B cell Pathol 55:299-310; by Kramer, R. (1930) "Adenoma Of Bronchus," 
Ann Oto] Rhinol Laryngol 39:689, and by Mark, E.J. etal (1985) "PERIPHERAL 
Small Cell Carcinoma Of The Lung Resembling Carcinoid Tumor," Arch 
Pathol Lab Med 109:263-269. 

Unfortunately, all neuroendocrine tumors have similar morphologic 
30 appearances and exhibit similar immunohistochemical staining. Thus, a significant 
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difficulty presently exists in accurately distinguishing between the different types 
of neuroendocrine tumors. Such diagnosis is still "decisively" based on light- 
microscopic evaluations of tissue samples for the number of cells involved in 
mitosis. Other than clinical stage at presentation, mitotic count is currently the sole 
5 independent histologic predictor of prognosis (Junker, K. et al (2000) 

"Pathology Of Small-Cell Lung Cancer," J Cancer Res Clin Oncol 725:361- 
8; Franklin, WA. (2000) "PATHOLOGY Of Lung Cancer" J Thorac Imaging, 75:3- 
12; Chyczewski, L. etal (2001) "MORPHOLOGICAL ASPECTS Of CARCINOGENESIS 
In The Lung" Lang Cancer, 34:S 17-25; Travis, W.D. etal. (1991) 
1 0 "NEUROENDOCRINE TUMORS OF THE LUNG WITH PROPOSED CRITERIA FOR LARGE- 

Cell Neuroendocrine Carcinoma. An Ultrastructural, 
immunohistochemical, and flow cytometric study of 35 cases" am j 
Surg Pathol 75:529-53; Brambilla, E. et al (2001) "The New World Health 
Organization Classification Of Lung Tumours" Eur Respir X 18: 1 059-68). 

15 Such microscopic evaluations of tissue samples is complex and difficult. 

Moreover, no "gold-standard" exists for defining neuroendocrine differentiation 
(Carnaghi, C. etal (2001) "Clinical Significance Of Neuroendocrine 
Phenotype In Non-Small-Cell Lung Cancer" AnnOncol 72:S119-23). The 
absence of an effective diagnostic standard complicates the management and 

20 treatment of neuroendocrine tumors (Oberg, K. (2001) "Chemotherapy And 
Bio-therapy In The Treatment Of Neuroendocrine Tumours," Ann Oncol 
/2:Slll-4). 

Researchers have attempted to apply the principles of molecular biology in 
order to identify molecular markers that would facilitate the diagnosis of 

25 neuroendocrine tumor types (see, for example, Japanese Patent Document JP 

58,198,758A2; and United States Patents Nos. 5,766,888; 5,856,097; 5,866,323; 
5,965,362; 5,976,790; 5,985,240; 5,998,154; 6,132,724; 6,166,176; 6,180,082; 
6,225,049; 6,238,877; 6,251,586; 6,335,167; and 6,358,491). Certain proteins, 
such as chromogranin A (CgA) and neuron-specific enolase (NSE) have been 

30 identified as having specific potential use in the clinical diagnosis of 
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neuroendocrine tumors (Seregni, E. etal (2000) "LABORATORY TESTS FOR 
Neuroendocrine Tumours" QJNuclMed. 44:22-41). Non-SCLC 
neuroendocrine tumors have been reported to overexpress CgA whereas SCLC 
tumors exhibit elevated NSE levels. Id. Lui 5 W.-O. et al (2001) "High Level 
5 Amplification Of 1 P32-33 And 2p22-24 In Small Cell Lung Carcinomas" 
Intl. J Oncol 7P:451-457 used comparative genomic hybridization analysis to 
identify chromosomal abnormalities in SCLC tumor cells. Through such analysis, 
several genetic regions were found to be amplified (i.e., Ip32, 2p23, lp32, and 
2p32). A loss of heterozygosity (LOH) is observed on 3p, 13q and 17p in nearly 

1 0 all SCLC tumors (Yokota et al (1 987) "Loss Of Heterozygosity On 

Chromosomes 3, 13 And 17 In Small Cell Carcinoma And On Chromosome 
3 In Adenocarcinoma Of The Lung" Proc. Natl Acad. Set (U.SA.) 84:9252- 
9256. Similarly, deletions in 1 lq have been correlated with the presence of AC 
and TC tumors (Walch, A.K. et al (1998) "Typical And Atypical Carcinoid 

1 5 Tumors Of The Lung Are Characterized B y 1 1 q Deletions As Detected By 
Comparative Genomic Hybridization" Am J Pathol 153:1 089-98). 

While such efforts have succeeded in identifying quantitative differences in 
mutations affecting various genes (for example finding that p53 is inactivated in 
>90% of SCLC tumors, but in only >50% of non-SCLC tumors, or that pi 6 
20 abnormalities arise in <1% of SCLC tumors but in -66% of non-SCLC tumors), 
clear correlations that would support a definitive differential diagnosis of tumor 
type has not been fully achieved (see, Ignacio, I. et al (2001) "MOLECULAR 
Genetics Of Small Cell Lung Carcinoma" Semin Oncol 25:3-13; Carnaghi, C. 
et al (2001) "Clinical Significance Of Neuroendocrine Phenotype In Non- 
25 Small-Cell Lung Cancer" Ann Oncol J2:S 1 1 9-23). In this regard, one recent 
study found no statistically significant correlation between any individual marker 
and response to chemotherapy for non-SCLC tumors (Gajra, A. etal (2002) "The 
Predictive Value Of Neuroendocrine Markers And p53 For Response To 
Chemotherapy And Survival In Patients With Advanced Non-Small Cell 
30 Lung Cancer" Lung Cancer. 36:1 59-65). Thus, a need remains for a usable 



molecular marker approach that could distinguish between the different types of 
neuroendocrine tumors. 

cDNA microarrays have been employed to analyze gene expression 
patterns in human cancers (DeRisi, J. et al (1996) "USE OF A CDNA 
MICROARRAY TO ANALYSE GENE EXPRESSION PATTERNS IN HUMAN CANCER" 
Nature Genetics 14:457-60). Such efforts have combined DNA amplification 
technologies (such as T7-based RNA amplification) with cDNA microarrays in 
order to improve the discriminating power of the analysis (Luo, L. et al (1999) 
"Gene Expression Profiles Of Laser-Captured Adjacent Neuronal 
Subtypes" Nature Medicine 5:1 17-22; Bonner, R.F. etal (1997) "LASER 
Capture Microdissection: Molecular Analysis Of Tissue" Science 
275:1481,1483; Schena, M. etal (1995) "Quantitative Monitoring Of Gene 
Expression Patterns With A Complementary DNA Microarray" Science 
270:467-70). 

Despite all such progress, no fully successful method for distinguishing 
between the neuroendocrine tumor types, and of thus accurately diagnosing 
neuroendocrine cancers has yet been achieved. The present invention is, in part, 
directed to such needs. 

Summary of the Invention: 

This invention relates to methods and compositions for the diagnosis of 
neuroendocrine lung cancers. The present invention permits one to accurately 
classify pulmonary neuroendocrine tumors based on their genome-wide expression 
profile without further manipulation. A hierarchical clustering of all genes 
classifies these tumors according to World Health Organization (WHO) 
histological type. The selection of genes with significant variance resulted in the 
identification of 198 transcripts, many of which have potentially important 
biological and clinical implications. The present invention thus defines and 
provides groups of genes that identify each tumor type, and permits one to derive a 
molecular signature that distinguishes each subtype of neuroendocrine tumor. 



In detail, the invention provides a method for determining whether a 
candidate cell is a neuroendocrine tumor cell, wherein the method comprises the 
steps of: 

(A) determining the profile of expression of a plurality of genes of the 
candidate cell; and 

(B) comparing such determined profile of expression with the profile of 
expression of the genes of a small cell lung cancer cell, a large cell 
neuroendocrine carcinoma cell, a typical carcinoid tumor cell or an atypical 
carcinoid tumor cell; 

to thereby determine whether the candidate cell is a neuroendocrine tumor cell. 

The invention particularly concerns the embodiment of such method 
wherein the method additionally permits a determination of neuroendocrine tumor 
cell type. The invention further concerns the embodiments of such methods 
wherein the method determines whether the candidate cell is a small cell lung 
cancer (SCLC) neuroendocrine tumor cell, a large cell neuroendocrine carcinoma 
(LCNEC) neuroendocrine tumor cell, a typical carcinoid (TC) neuroendocrine 
tumor cell, or an atypical carcinoid (AC) neuroendocrine tumor cell. 

The invention particularly concerns the embodiments of such methods 
wherein the plurality of genes includes one or more genes selected from the group 
consisting of C5, CPE, GRIA2, RIMS2, ORC4L, CSF2RB, GGH, NPAT, NR3C1, 
P31 1, PRKAA2, PTK6, APRT, ARF4L, ARHGD1A, ARL7, ATP6F, CDC20, 
CDC34, CLDN1 1, COMT, CSTF1, DDX28, JDHCR7, ERP70, FEN1, GCN1L1, 
GNB1, GUK1, HDAC7A, ITPA, JUP, K1AA0469, KRT5, PDAP1, PGAM1, PHB, 
POLA2, POLD2, POLE3, PYCR1, SIP2-28, SIVA, SURF 1, TAD A3 L, TK1, 
TYMSTR, and VATI, and especially wherein the plurality of genes includes one or 
more genes selected from the group consisting of GGH and CPE. 

The invention further concerns the embodiments of such methods wherein 
step (A) of the methods comprise incubating RNA of the candidate cell, or DNA or 
RNA amplified from such RNA, in the presence of a plurality of genes, or 
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fragments or RNA transcripts thereof, under conditions sufficient to cause RNA to 
hybridize to complementary DNA or RNA molecules; and detecting hybridization 
that occurs. 

The invention additionally concerns the embodiments of such methods 
5 wherein the plurality of genes, or polynucleotide fragments or RNA transcripts 
thereof, are distinguishably arrayed in a microarray. The invention particularly 
concerns the embodiments of such methods wherein the microarray comprises 
arrayed genes, or polynucleotide fragments or RNA transcripts thereof, that are 
differentially expressed in neuroendocrine tumor cells relative to normal cells. 

10 The invention particularly concerns the embodiments of such methods 

wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in small cell lung cancer 
(SCLC) neuroendocrine tumor cells relative to large cell neuroendocrine 
carcinoma (LCNEC) neuroendocrine tumor cells. 

15 The invention particularly concerns the embodiments of such methods 

wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in small cell lung cancer 
(SCLC) neuroendocrine tumor cells relative to typical carcinoid (TC) 
neuroendocrine tumor cells. 

20 The invention particularly concerns the embodiments of such methods 

wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in small cell lung cancer 
(SCLC) neuroendocrine tumor cells relative to atypical carcinoid (AC) 
neuroendocrine tumor cells. 

25 The invention particularly concerns the embodiments of such methods 

wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in large cell 
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neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cells relative to 
atypical carcinoid (AC) neuroendocrine tumor cells. 

The invention particularly concerns the embodiments of such methods 
wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
5 RNA transcripts thereof, that are differentially expressed in large cell 

neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cells relative to typical 
carcinoid (TC) neuroendocrine tumor cells. 

The invention particularly concerns the embodiments of such methods 
wherein the arrayed genes, or polynucleotide fragments or RNA transcripts 

10 thereof, include one or more genes selected from the group consisting of C5, CPE, 
GRIA2, RIMS2, ORC4L, CSF2RB, GGH, NPAT, NR3CI, P3 11, PRKAA2, 
PTK6, APRT, ARF4L, ARHGD1A, ARL7, ATP6F, CDC20, CDC34, CLDN1 1, 
COMT, CSTF1, DDX28, DHCR7, ERP70, FEN1, GCN1L1, GNB1, GUK1, 
HDAC7A, ITPA, JUP, K1AA0469, KRT5, PDAP1, PGAM1, PHB, POLA2, 

15 POLD2, POLE3, PYCR1, SIP2-28, SIVA, SURF 1, TAD A3 L, TK1, TYMSTR, 
and VATI,. 

The invention especially concerns the embodiments of such methods 
wherein the arrayed genes, or polynucleotide fragments or RNA transcripts 
thereof, include one or more genes selected from the group consisting of GGH and 
20 CPfi, or polynucleotide fragments or RNA transcripts thereof. 

The invention particularly concerns the embodiments of such methods 
wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in large cell 
neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cells relative to 
25 atypical carcinoid (AC) neuroendocrine tumor cells. 

The invention particularly concerns the embodiments of such methods 
wherein the microarray comprises arrayed genes, or polynucleotide fragments or 
RNA transcripts thereof, that are differentially expressed in typical carcinoid (TC) 
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neuroendocrine tumor cells relative to atypical carcinoid (AC) neuroendocrine 
tumor cells 

The invention additionally concerns a microarray of genes, or 
polynucleotide fragments or RNA transcripts thereof for distinguishing a 
5 neuroendocrine tumor cell, the microarray comprising a solid support having 
greater than 10 genes, or polynucleotide fragments or RNA transcripts thereof, 
distinguishably arrayed in spaced apart regions, wherein the microarray comprises 
a sufficient number of genes, or polynucleotide fragments or RNA transcripts 
thereof, that are differentially expressed in a small cell lung cancer (SCLC) cell, a 
10 large cell neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cell, a 

typical carcinoid (TC) neuroendocrine tumor cell, or an atypical carcinoid (AC) 
neuroendocrine tumor cell, relative to a normal cell or a cell belonging to a 
different neuroendocrine tumor cell type, to permit the microarray to distinguish a 
pulmonary neuroendocrine tumor cell. 

( 

1 5 The invention particularly concerns the embodiment of such microarray 

wherein the microarray comprises a sufficient number of genes, or polynucleotide 
fragments or RNA transcripts thereof, that are differentially expressed in a 
neuroendocrine tumor cell relative to a normal cell to permit the microarray to 
distinguish between a neuroendocrine tumor cell and a normal cell. 

20 The invention particularly concerns the embodiments of such microarrays 

wherein the microarray comprises a sufficient number of genes, or polynucleotide 
fragments or RNA transcripts thereof, that are differentially expressed in a small 
cell lung cancer (SCLC) neuroendocrine tumor cell relative to a large cell 
neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cell to permit the 

25 microarray to distinguish between a small cell lung cancer (SCLC) neuroendocrine 
tumor cell and a large cell neuroendocrine carcinoma (LCNEC) neuroendocrine 
tumor cell 

The invention particularly concerns the embodiments of such microarrays 
wherein the microarray comprises a sufficient number of genes, or polynucleotide 
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fragments or RNA transcripts thereof, that are differentially expressed in a small 
cell lung cancer (SCLC) neuroendocrine tumor cell relative to a typical carcinoid 
(TC) neuroendocrine tumor cell to permit the microarray to distinguish between a 
small cell lung cancer (SCLC) neuroendocrine tumor cell and a typical carcinoid 
5 (TC) neuroendocrine tumor cell. 

The invention particularly concerns the embodiments of such microarrays 
wherein the microarray comprises a sufficient number of genes, or polynucleotide 
fragments or RNA transcripts thereof, that are differentially expressed in a small 
cell lung cancer (SCLC) neuroendocrine tumor cell relative to an atypical carcinoid 
10 (AC) neuroendocrine tumor cell to permit the microarray to distinguish between a 
small cell lung cancer (SCLC) neuroendocrine tumor cell and an atypical carcinoid 
(AC) neuroendocrine tumor cell. 

The invention particularly concerns the embodiments of such microarrays 
wherein the microarray comprises a sufficient number of genes, or polynucleotide 
15 fragments or RNA transcripts thereof, that are differentially expressed in a large 
cell neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cell relative to a 
typical carcinoid (TC) neuroendocrine tumor cell to permit the microarray to 
distinguish between a large cell neuroendocrine carcinoma (LCNEC) 
neuroendocrine tumor cell and a typical carcinoid (TC) neuroendocrine tumor cell. 

20 The invention particularly concerns the embodiments of such microarrays 

wherein the microarray comprises a sufficient number of genes, or polynucleotide 
fragments or RNA transcripts thereof, that are differentially expressed in a large 
cell neuroendocrine carcinoma (LCNEC) neuroendocrine tumor cell relative to an 
atypical carcinoid (AC) neuroendocrine tumor cell to permit the microarray to 

25 distinguish between a large cell neuroendocrine carcinoma (LCNEC) 

neuroendocrine tumor cell and an atypical carcinoid (AC) neuroendocrine tumor 
cell. 

The invention particularly concerns the embodiments of such microarrays 
wherein the microarray comprises a sufficient number of genes, or polynucleotide 
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fragments or RNA transcripts thereof, that are differentially expressed in a typical 
carcinoid (TC) neuroendocrine tumor cell relative to an atypical carcinoid (AC) 
neuroendocrine tumor cell to permit the microarray to distinguish between a 
typical carcinoid (TC) neuroendocrine tumor cell and an atypical carcinoid (AC) 
5 neuroendocrine tumor cell. 

The invention particularly concerns the embodiments of such microarrays 
wherein the genes or polynucleotide fragments or RNA transcripts thereof of the 
microarray include one or more genes selected from the group consisting of C5, 
CPE, GRIA2, RIMS2, ORC4L, CSF2RB, GGH, NPAT, NR3C1, P311, PRKAA2, 
10 PTK6, APRT, ARP4L, ARHGD1A, ARL7, ATP6F, CDC20, CDC34, CLDN1 1, 
COMT, CSTF1, DDX28, DHCR7, ERP70, FEN1, GCN1L1, GNB1, GUK1, 
HDAC7A, ITPA, JUP, K1AA0469, KRT5, PDAP1, PGAM1, PHB, POLA2, 
POLD2, POLE3, PYCR1, SIP2-28, SIVA, SURF 1, TADA3L, TK1, TYMSTR, 
and VATI, or a polynucleotide fragment or RNA transcript thereof. 

15 The invention further concerns the embodiments of such microarrays 

wherein the genes or polynucleotide fragments or RNA transcripts thereof of the 
microarray include one or more genes selected from the group consisting of GGH 
and CPE, or a polynucleotide fragment or RNA transcript thereof. 

Brief Description of the Figures: 

20 Figure 1 shows the hierarchical clustering of genes with statistically 

significant variance (p<0.004) among all tumor samples. 

Figure 2 shows the hierarchical clustering of 198 genes, created by 
enforcing the classification of 17 tumors. 

Figures 3A and 3B show the expression of genes of large cell 
25 neuroendocrine tumor cells and typical carcinoid tumor cells. 

Figure 4 shows a dendrogram of pulmonary NE tumors based on 
expression of 198 genes. Seventeen cases of the NE tumors were sorted by one- 
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way hierarchical clustering based on the expression similarities of 198 genes that 
were selected from 9,984 genes based on the expression changes in the three 
subtype tumors with significant statistical difference (F-test, p<0.004). Medium 
gray, light gray, and black signal indicate that expression of these genes is higher, 
lower or equal to the median level of expression in all samples, respectively. 
White represents missing genes or poor quality data. TC: typical carcinoid; SC: 
small cell lung cancer; LC: large cell neuroendocrine carcinoma; SC+LC: a tumor 
sample with 90% SC and 10% LC. The numbers are the case numbers of the tumor 
samples. 

Figures 5A, 5B, 5C, 5D, 5E, and 5F show comparisons of expression 
changes detected by microarrays and real-time quantitative RT-PCR. RNA isolated 
from LCM cells was examined in triplicates for expression of three representative 
genes upregulated in each tumor subtype. The gene expression changes detected 
by real-time RT-PCR (Figure 5A-C) were presented here in comparisons with 
those derived from cDNA microarray analysis (Figure 5D-F). The expression of 
each gene in the RT-PCR analysis was normalized first by expression of the 18S 
ribosomal gene in the same cell line and then by the expression of that gene in the 
BEAS-2B control cells. CPE: carboxypeptidase E; P3 1 1 : a gene of neuronal 
marker; CDC20: human homolog gene for S. cerevisiae cell division cycle 20 
gene. TC: typical carcinoid; SC: small cell lung cancer; LC: large cell 
neuroendocrine carcinoma. The 17 pulmonary NET cases were arranged from left 
to right in each panel in the same order of 1240, 1672, 1 1 169, 1 1934, 12454, 
12878, 890, 1047, 11061, 12346, 12457, 12893, 13369, 10110, 10249, 10373, and 
12700. The primer pairs for RT-PCR are: CPE: (SEQ ID NO:2) 5'- 
TTGTCCGAGACCTTCAAGGTAAC-3' and (SEQ ID NO:3) 5'- 
CCTTTGCGGATGTAACATCGT-3 t ; P311: (SEQ ID NO:4) 5'- 
TGGGTCAGTCAAGAACCATTTC-3' and (SEQ ID NO:5) 5'- 
ACTTCCTTTGGGACAGGAAGTCT-3'; and CDC20: (SEQ ID NO:6) 5'- 
CTGAACGGTTTTGATGTAGAGGAA-3' and (SEQ ID NO:7) 5*- 
CCCTCTGGCGCATTTTGT-3'. 



-13- 



Figures 6A and 6B show the results of Kaplan-Meier Survival rates of 54 
cases of pulmonary NET patients as function of CPE or GGH expression. Figure 
6A shows the survival rates of patients with positive and negative CPE stains on 
pulmonary NET cells. The survival rate (76%) for the patients with the positive 
CPE are statistically significant (p=0.023) higher than that (27%) with the negative 
stain. Figure 6B shows the inverse correlation of the survival rates to the GGH 
expression in pulmonary NET cells. The survival rates to positive and negative 
GGH stains in pulmonary NET cells were 28% and 83%, respectively, with the 
statistic significance (p=0.0035). X indicates censored samples. 

Description of the Preferred Embodiments: 

The invention concerns methods and compositions for the diagnosis of 
neuroendocrine lung cancers. Lung cancer is a leading cause of cancer-related 
deaths (Franceschi, S. et al (1999) "The Epidemiology Of Lung Cancer," Ann. 
Oncol. 10 Suppl 5:S3-6). Pulmonary neuroendocrine tumors (NETs) account for 
20-30% of lung cancer cases and lung cancer is the leading cause of cancer-related 
death (Parkin, D M. et al (1999) "Global Cancer Statistics," CA Cancer J 
Clin 49:33-64, 1). The observed continuous relative increase in the incidence of 
SCLC (Junker, K. et al (2000) "Pathology of Small-Cell Lung Cancer," J. 
Cancer Res. Clin. Oncol. 126:361-368) reflects cigarette smoking, lack of effective 
methods for early diagnosis and inadequate predictive markers of aggressive lung 
cancer types. 

Pulmonary NETs include low-grade typical carcinoid (TC), intermediate- 
grade atypical carcinoid (AC), and high-grade large cell neuroendocrine carcinoma 
(LCNEC) and small cell lung cancer (SCLC) (Travis, W.D. et al (1998) 
"Reproducibility Of Neuroendocrine Lung Tumor Classification," Hum 
Pathol. 29:272-279). TC, AC and LCNEC collectively comprise only 3%-5% of all 
pulmonary malignancies, whereas SCLC accounts for 15%-25% (Travis, W.D. et 
al (1998) "Reproducibility Of Neuroendocrine Lung Tumor 
Classification," Hum Pathol. 29:272-279; Travis, W.D. etal (1991) 
"Neuroendocrine Tumors Of The Lung With Proposed Criteria For Large- 
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Cell Neuroendocrine Carcinoma. An Ultrastructural, 
Immunohistochemical, And Flow Cytometric Study Of 35 Cases," Am J 
Surg Pathol. 15:529-553). The prognostic relevance of pulmonary NETs has 
changed significantly since the recent recognition of the LCNEC subtype (Travis, 
5 W.D. et al ( 1 998) "REPRODUCIBILITY OF NEUROENDOCRINE LUNG TUMOR 

Classification," Hum Pathol. 29:272-279; Travis, W.D. etal (1991) ", 
••Neuroendocrine Tumors Of The Lung With Proposed Criteria For Large- 
Cell Neuroendocrine Carcinoma. An Ultrastructural, 
Immunohistochemical, And Flow Cytometric Study Of 35 Cases," Am J 

10 Surg Pathol. 15:529-553). The 5- and 10-year survival rates for TC are 87% and 
87%, for AC are 56% and 35%, for LCNEC are 27% and 9%, and for SCLC are 
9% and 5%, respectively. Pulmonary NETs have a similar morphologic appearance 
with organoid, trabecular or rosette-like pattern, and the immunohistochemical 
staining for neuroendocrine markers: chromogranin, synaptophysin, and neural cell 

15 adhesion molecule (NCAM, CD56). To distinguish these tumors from non-small 
cell lung cancers (NSCLC), some cases are analyzed by electron microscopy for 
the presence of neuroendocrine granules. Prior to the present invention, no 
specific molecular markers had been identified that could distinguish subtypes of 
pulmonary NETs and, other than clinical stage at presentation, the tumor mitotic 

20 index is the only independent histologic predictor of survival. The current 

treatment for patients with TC and AC is surgical resection, because these tumors 
grow slowly and are frequently detected as solitary pulmonary lesions. In contrast, 
surgical resection is feasible in less than one third of the LCNEC patients, with or 
without neoadjuvant treatment. Unfortunately, at the time of diagnosis, most 

25 SCLC are disseminated and prognosis is poor. Thus, accurate diagnosis of the 
pulmonary NET subtypes is essential for appropriate treatment and prediction of 
clinical outcome (Travis, W.D. et al (1998) "Survival Analysis Of 200 
Pulmonary Neuroendocrine Tumors With Clarification Of Criteria For 
Atypical Carcinoid And Its Separation From Typical Carcinoid," Am J 

30 Surg Pathol. 22:934944; Zacharias, J. et al (2003) "LARGE Cell 
Neuroendocrine Carcinoma And Large Cell Carcinomas With 



WO 2004/041196 



PCT/US2003/034787 



-15- 

Neuroendocrine Morphology Of The Lung: Prognosis After Complete 
Resection And Systematic Nodal Dissection," Ann. Thome. Surg. 75:348- 
352). 

Neuroendocrine tumors are a distinct subset of lung cancers that share 
5 morphologic, ultrastructural, immunohistochemical, and molecular characteristics. 
As indicated above, the term neuroendocrine tumors encompasses small cell lung 
cancer (SCLC) tumors, large cell neuroendocrine carcinomas, typical carcinoid 
(TC) tumors and atypical carcinoid (AC) tumors. All neuroendocrine tumors have 
similar morphologic appearance with organoid, trabecular or rosettelike pattern; 
10 immunohistochemical staining for chromogranin (Cga), synaptophysin, neuron- 
specific enolase (NSE), neural cell adhesion molecule (NCAM), and the presence 
of neuroendocrine granules, which can be detected by electron microscopy (Fisher, 
E.R. eta!. (1978) "COMPARATIVE HISTOPATHOLOGIC, HlSTOCHEMICAL, ELECTRON 

Microscopic And Tissue Culture Studies Of Bronchial Carcinoids And 
15 Oat Cell Carcinomas Of The Lung," Am J Clin Pathol 69: 165-172). 

The dramatic differences in survival exhibited by the different 
neuroendocrine malignancies reflect fundamental differences in biological 
behavior and therapeutic approaches in these tumors (Travis, W.D., et al (1998) 
"Survival Analysis Of 200 Pulmonary Neuroendocrine Tumors: With 

20 Clarification Of Criteria For Atypical Carcinoid And Its Separation 

From Typical Carcinoid," Am J Surg Pathol 22:934-944). Current treatment for 
patients with TC involves surgical resection because the tumors are slow growing 
and frequently detected as solitary pulmonary lesions. In less than one third of 
patients with LCNEC, surgical resection is possible without neoadjuvant treatment. 

25 Unfortunately, at the time of diagnosis, most SCLC tumors are disseminated, 
treatment is not effective and the prognosis is poor. Thus, accurate diagnosis of 
each type of pulmonary neuroendocrine tumors is essential for successful clinical 
outcome. 
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The combined use of light microscopy, immunohistochemistry and electron 
microscopy has increased the oncologist's ability to differentiate different subtypes 
of neuroendocrine tumors and has provided clues regarding their pathogenesis. 
However, little information is available on genetic changes associated with each 
type of neuroendocrine tumors. 

Over the past decade, there have been significant changes in the 
classification of pulmonary neuroendocrine tumors in order to improve prediction 
of their biological behavior. The accurate diagnosis of each pulmonary tumor 
subtype is critical for decisions of therapy. A diagnosis based on light microscopic 
examination, specifically in differentiation of SCLC from other pulmonary NETs 
is often challenging. Unfortunately, there are no molecular markers to aid in 
differentiation of each tumor subtype. 

In accordance with the methods of the present invention, the analysis of 
genome-wide gene expression in neuroendocrine tumors from cDNA microarray 
data (often referred to as "unsupervised learning") accurately distinguishes each 
tumor type. The pattern of gene expression has been found to correlate with each 
subtype assigned by light microscopy according to WHO/LASLSC classification 
(Histopathological classification of these tumors is based on the 1999 WHO 
Classification (Travis, W.D. et al (1999) "Histologic Typing Of Lung And 
Pleural Tumors" (Ed 3). Berlin, Germany, Springer). 

Microarray technology is widely used to identify changes in gene 
expression accompanying altered cell physiology during development, cell cycle 
progression, drug treatment or disease progression. Related phenotypes are usually 
accompanied by related patterns of cellular transcripts that can be used to 
characterize these changes. The present invention exploits the recent development 
of DNA microarray technology (see, for example, DeRisi, J. et al (1996) "USE Of 
A cDNA Microarray To Analyse Gene Expression Patterns In Human 
Cancer" Afa/wre Genetics 14:457-60; Luo, L. et al (1999) "Gene Expression 
Profiles Of Laser-Captured Adjacent Neuronal Subtypes" Nature 
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Medicine 5:1 17-22; Bonner, R.F. et al (1997) "Laser Capture 
Microdissection: Molecular Analysis Of Tissue" Science 2 78: 1 48 1 , 1 483 ; 
Schena, M. et al (1 995) "Quantitative Monitoring Of Gene Expression 
Patterns With A Complementary DNA Microarray" Science 270:467-70) to 
analyze genome-wide changes that may distinguish these tumors and discover 
molecular markers. The identification of such markers and their subsequent use 
ion the diagnosis and monitoring of neuroendocrine cancers permits a more 
effective selection of treatment modalities for individual patients. 

The analysis of changes in gene expression in clinical specimens is 
complicated by the mixture of tumor and normal cells, as well as stromal, vascular, 
and other cells obtained in biopsy. In addition, the heterogeneity of cell type 
hinders the study of gene expression profiles in cancer cells. Although the 
principles of the present invention may be used with tissue biopsies and other 
tissue samples, most preferably, the analysis will be conducted with single cells. 
Such single cells can be isolated using any of a variety of methods, however, the 
use of laser capture microdissection (LCM) is preferred. Laser capture 
microdissection is a procedure that permits the harvesting of a specific cell 
population directly from frozen sections. The procedure involves fixing the 
desired cells to a thermoplastic film following infrared laser pulse to avoid 
"contamination" by other cell populations (Emmert-Buck, M.R. etal (1996) 
"Laser Capture Microdissection," Science 274:998-1001; Goldsworthy, S.M. et al 
(1999) "Effects Of Fixation On RNA Extraction And Amplification From 
Laser Capture Microdissected Tissue," Molecular Carcinogenesis, 1999, 86- 
91; Luo, L. et al (1999) "Gene Expression Profiles Of Laser-Captured 
Adjacent Neuronal Subtypes" Nature Medicine 5:1 17-22). 

Most preferably, the PixCell™ LCM system (Arcturus, Moutain View, CA) 
is used for laser capture microdissection (Bonner, R.F., etal (1997) "Laser 
Capture Microdissection: Molecular Analysis Of Tissue," Science 278: 
1481,1483). The examples described below illustrate the desirability of isolating 
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tumor cells from vascular and inflammatory components frequently found in 
surgical specimens of lung cancer and other vascular tumors. 

The present invention thus permits one to distinguish between different 
neuroendocrine tumor subtypes based on their expression profiles. Preferably, 
5 such analysis will involve a comparison of the expression of multiple genes, and is 
accomplished by assessing the extent or presence of hybridization occurring 
between RNA transcripts (or cDNA copies thereof) of a candidate cell and genes, 
or polynucleotide fragments or RNA transcripts thereof of a reference cell that are 
differentially expressed in some or all neuroendocrine tumor cells. As used herein, 

1 0 a gene is said to be "differentially expressed" in a tumor cell if detection of its 
expression facilitates the determination that a candidate cell is or is not a tumor 
cell. As used herein, the term "polynucleotide fragment" refers to a polynucleotide 
that is either a portion of a gene, cDNA or RNA molecule, or a complement of 
such molecules, and which possesses a length of at least 10 nucleotide residues, at 

15 least 15 nucleotide residues, at least 20 nucleotide residues, at least 25 nucleotide 
residues, at least 35 nucleotide residues, at least 50 nucleotide residues, at least 75 
nucleotide residues, at least at least 100 nucleotide residues, at least 150 nucleotide 
residues, or at least 200 nucleotide residues. 

Clones containing suitable genes, and from which suitable polynucleotide 
20 fragments or RNA transcripts can be made, are obtainable from Incyte Genomics 
(www.incvte.comy The present invention provides a preferred set of 198 genes 
that are particularly suited for use in such analysis. Clones of these genes are 
commercially available from Incyte Genomics using the Incyte Clone ID No. 
information provided in Table 2. Preferably the analysis will be conducted using 
25 10%, 20%, 50%, 70%, 80%, 90% or all of these 198 genes, alone or in 

combination with other genes, or polynucleotide fragments or RNA transcripts 
thereof. These 198 genes have been found to define three different cluster groups. 
The analysis may involve a comparison of the expression of genes belonging to the 
same cluster group, or to two or more different cluster groups. 
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cDNA microarrays are preferably performed on a solid surface, such as a 
chip or slide. Preferably, such surfaces will contain multiple human genes, or 
polynucleotide fragments or RNA transcripts thereof, distinguishably arrayed. As 
used herein, the term "distinguishably arrayed" is intended to denote that such 
gene's (or its fragment or transcript's location on the surface is distinct or 
distinguishable from the locations of other gene(s) that may be bound to the 
support. 

Most preferably, the array will contain gene fragments of hundreds or 
thousands of human genes. A glass slide containing gene fragments of 9,984 
human genes (provided by the Advanced Technology Center of the National 
Cancer Institute) is preferably employed. Clones and arrays are also available 
from Incyte Genomics, Palo Alto, CA, and other sources. 

For analyzing such microarrays, nucleic acid, most preferably RNA, is 
isolated from candidate neuroendocrine cells. Any of a wide variety of 
amplification procedures may be employed. In a preferred embodiment of the 
invention, a T7-based RNA amplification procedure ins employed, such as that 
described by Luo, L. et al (1999) ("Gene Expression Profiles Of Laser- 
Captured Adjacent Neuronal Subtypes" Nature Medicine 5: 1 1 7-22). To 
facilitate the analysis, the amplified material is preferably labeled, as with a 
radioactive, fluorescent, chemiluminescent, enzymatic, haptenic, or other label, 
and incubated with the arrayed gene fragments under conditions suitable for 
nucleic acid hybridization to occur (see, for example, Schena, M. et al (1995) 
"Quantitative Monitoring Of Gene Expression Patterns With A 
Complementary DNA Microarray" Science 270:467-70). 

The hybridized array are then analyzed for their pattern of hybridization. 
Detection of hybridization, e.g., detection of the labeled amplified material 
hybridized to a region of the array, indicates that the gene present at such region 
was expressed by the candidate cell being analyzed. Most preferably, such 
analysis will employ an automated scanning device, such as a GenePix 4000A 
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Laser Scanner (Axon Instruments, Inc., Foster City, CA ) in conjunction with 
software for conducting such analysis. The BRB ArrayTools (ver 2.0) is preferred 
for this purpose fhttp://linus.nci.nih.gov/BRB-ArravTools.htmn . 

Having now generally described the invention, the same will be more 
5 readily understood through reference to the following examples, which are 

provided by way of illustration, and are not intended to be limiting of the present 
invention, unless specified. 

Example 1 
cDNA Microarray ' 

10 In order to identify molecular markers of pulmonary neuroendocrine 

tumors, the gene expression profile of clinical samples from patients with TC, 
LCNEC, and SCLC is analyzed by cDNA microarrays, preferably as follows: 

Tissue Collection And RNA Quality Assessment. Archived, frozen lung 
tumor tissues are collected from hospitals over an 1 1 year period. Tumor tissues 

15 are flash-frozen at surgery and stored at -80°C until used. The frozen tumor tissue 

block is prepared with O.C.T. mount medium and the quality of total RNA of each ( 
sample is evaluated by spectrophotometer and gel electrophoresis after 
phenol/chloroform extraction from one frozen section. Histopathological 
classification of these tumors is based on the 1999 WHO Classification (Travis, 

20 W.D. et al (1 999) "HISTOLOGIC TYPING OF LUNG AND PLEURAL TUMORS" (Ed 3). 
Berlin, Germany, Springer). Two large cell neuroendocrine carcinomas (case 1240 
and 1672) are confirmed by demonstrating the neuorendocrine immuno-phenotype 
with positive NCAM (CD56) staining. Table 1 summarizes clinical findings in the 
pulmonary NE tumors. 
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Table 1 



Clinical Features Of 17 Patients With Pulmonary Neuroendocrine Tumors 



Histology 




Sex 


Age 


Smoking 






Male 


Female 


Range 


Mean 




TC 


(n-11) 


7 


4 


35-68 


50 


7 (64%) 


LCNEC 


(n=2) 


2 


0 


59-60 


60 


2 (100%) 


SCLC 


(n=4) 


3 


1 


43-75 


65 


4 (100%) 


1 TOTAL 


(n=17) 


12 


5 


35-75 


65 


13 (100%) 



Laser Capture Microdissection Of 17 Neuroendocrine Tumors. Frozen 
tumor tissue (0.5 x 0.5 x 0.5 cm) are embedded in O.C.T. in a cryomold, and 
immersed immediately in dry ice-cold 2-methylbutane at-50°C. Sections of 
frozen tissue (8 mm) are mounted on silane coated glass slides and kept at -80°C 
5 until use. The slides are immediately fixed by immersion in 70% ethanol, stained 
with H&E and air-dried for 10 minutes after xylene treatment. 

The PixCell™ LCM system (Arcturus, Moutain View, CA) is used for 
LCM (Bonner, R.F., et al (1997) "Laser Capture Microdissection: 
Molecular Analysis Of Tissue," Science 278: 1481,1483). Tumor cells are 

10 fused to transfer film by thermal adhesion after laser pulse and were then 

transferred into tubes containing solution D in the Strategene Micro RNA isolation 
kit that contains gaunidinium thiocyanate (GTC) and beta-mercaptoethanol. For 
each specimen, 15 to 18 frozen sections are used to maximize the quantity of RNA. 
Total RNA is extracted using a Micro RNA isolation kit (Strategene, La Jolla, CA) 

15 according to the manufacturer's instructions. Purified total RNA was resuspended 
in 1 1 ml of diethyl pyrocarbonate (DEPC), treated water, and used directly for 
RNA amplification and subjected to cDNA microarray analysis (Schena, M. et al 
(1995) "Quantitative Monitoring Of Gene Expression Patterns With A 
Complementary DNA Microarray," Science 270(523 5):467-70; DeRisi, J. et 

20 al (1 996) "Use Of A cDNA Micro Array To Analyse Gene Expression 
Patterns In Human Cancer," Nature Genetics 14:457-60, Lyer, R.P. et al 
(1999) "Modified Oligonucleotides-Synthesis, Properties And 
Applications," Curr. Opin. Mol. Ther. 1:344-358). 
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RNA Amplification. The RNA amplification procedure used is preferably 
as described by Luo, L. et al. (l 999) ("Gene Expression Profiles Of Laser- 
Captured Adjacent Neuronal Subtypes," Nature Med 5: 1 17-122). The 
method relies on attaching a T7 promoter sequence to the oligo(dT) primer. A 
preferred such sequence for synthesis of the first strand cDNA is SEQ ID NO.-.l: 

5' TCTAGTCGAC GGCCAGTGAA TTGTAATACG ACTCACTATA 
GGGCGTTTTT TTTTTTTTTT TTTTTT 3 ' 

After second strand cDNA synthesis, amplified RNA is generated using T7 RNA 
polymerase and the double-stranded cDNA molecules as targets for the linear 
amplification. The T7 promoter sequence is regenerated in subsequent rounds by 
priming the first strand cDNA synthesis reaction with random hexamers 
(Amersham Biosciences, Piscataway, NJ). The quality and quantity of amplified 
RNA were evaluated spectrophotometricaly and by migration in 2.4% agarose gel 
electrophoresis, respectively. 

Cell Culture. BEAS-2B cell line (Amstad, P. etal. (1988) "NEOPLASTIC 
Transformation Of A Human Bronchial Epithelial Cell Line By A 
Recombinant Retrovirus Encoding Viral Harvey Ras," Mol Carcinog. 1988 
1:151-60) is cultured in a serum-free medium, LHC-9 (Biofluids, Rockville, MD). 
Total RNA is isolated from cells with Trizol, followed by phenol/chloroform and 
isopropanol extraction and purification (Stratagene, La Jolla, CA). Two rounds of 
amplified RNA are generated from the cell line as described above. 

Microarrays Hybridization. cDNA microarrays are performed in 
duplicate for each sample on glass slides containing 9,984 human genes which 
were provided by the Advanced Technology Center of the National Cancer 
Institute. BEAS-2B amplified RNA (8 ug) is labeled with Cy5-dUTP and test 
samples (4 mg each) are labeled with Cy3-dUTP using Superscript II (Invitrogen, 
Carlsbad, CA). Briefly, RNA is incubated with Cy3-dUTP (or Cy5-dUTP) (Perkin 
Elmer Life Sciences, Boston, MA) at 42°C for Ih to synthesize the first strand of 
cDNA. The reaction is stopped by addition of 5 pi 0.5M EDTA and 10 pi IN 
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NaOH followed by incubation at 65°C for 60 min. After neutralization, the 
samples are purified by centrifiigation with a Microcon 30 (Millipore Corp., 
Bedford, MA) to remove unincorporated nucleotides and salts. The Cy3- and Cy5- 
labeled samples of each pair are combined and heated at 100°C for 2 min. After 
centrifiigation for 10 minutes, the samples are placed onto the center of a glass 
microarray slide and hybridized at 65°C for 16h. The slides are washed to a final 
stringency of 0.2 x SSC at room temperature for 2 min prior to analysis. 

Image And Statistic Analysis. Hybridized array slides are scanned with a 
GenePix 4000A Laser Scanner (Axon Instruments, Inc., Foster City, CA ). 
Analysis is performed using BRB ArrayTools (ver 2.0) developed by Drs. Richard 
Simon and Amy Peng (http://linus.nci.nih.gov/BRB-ArravTools.htmn . 
Hierarchical clustering was performed on 8,987 clones with log-ratios present in at 
least 4 samples for each gene. 

Example 2 
cDNA Microarray Results 

The results of the microarray analysis are obtained using Laser Capture 
Microdissection (LCM) as follows: 

Laser Capture Microdissection (LCM) Of Clinical Samples. Use of 

LCM improves the sample preparation of microarray analysis by avoiding 
contamination with other cell types. (Emmert-Buck, M.R. et al (1996) "Laser 
Capture Microdissection," Science 274:998-1001). This selection is particularly 
desirable for analysis of tumors from lung, prostate, overy, and others (Ornstein, 
D.K. et al (2000) "Proteomic Analysis Of Laser Capture Microdissected 
Human Prostate Cancer And In Vitro Prostate Cell Lines," Electrophoresis 
21(1 1):2235-2242; Mirura, K. et al (2002) "LASER CAPTURE MICRODISSECTION 
And Microarray Expression Analysis Of Lung Adenocarcinoma Reveals 
Tobacco Smoking- And Prognosis Related Molecular Profiles," Cancer 
Res. 62:3244-3250; Ono, K. et al (2000) "Identification By CDNA 
Microarray Of Genes Involved In Ovarian Carcinogenesis," Cancer Res. 
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60:5007-501 1). Tumor cells are selected by LCM from frozen sections. High 
quality RNA is obtained from these dissected materials. . 

Microarray Analysis Of Gene Expression Profiles Of Pulmonary 
Neuroendocrine Tumors. The invention tested the hypothesis that gene 
5 expression profiling using cDNA microarray analysis can effectively identify 
subtypes of pulmonary neuroendocrine tumors classified by light microscopy 
according to WHO recommendations. Hierarchical clustering of 8,987 human 
genes, often referred to as unsupervised learning, separated samples into clusters 
based on overall similarity in gene expression without prior knowledge of sample 

10 identity. The hierarchical clustering of genes with statistically significant variance 
(p<0.004) among all tumor samples is displayed in Figure 1. After decoding the 
specimens, it was immediately apparent that clustering based on genome-wide 
expression divides the tumors into their assigned WHO classification with 100% 
accuracy. Tumor samples from TC, LCNEC and SCLC clusters with their 

15 respective subtype indicating similarities of gene expression shared by these 
tumors. The length of the branches indicates the relatedness of neuroendocrine 
tumors. Three distinct groups of tumors can be identified by this display. The 
sample, which contains features of LCNEC and SCLC clusters between LCNEC 
and SCLC with a shorter distance to SCLC. Thus, the data support the molecular 

20 classification that predicted morphological classification of human pulmonary 
neuroendocrine tumors. The data indicates that WHO proposed morphological 
classification of pulmonary neuroendocrine tumors correspond to a significant 
similarity of their molecular profiles. 

The Class Comparison Tool is used to select genes differentially expressed 
25 among each tumor type at an assigned statistical significance level. The F-test, 
which measures levels of variance in gene expression among each sample, is used 
to compare the defined classes of tumors using BRB ArrayTool. This analysis 
results in the identification of a set of 198 genes that have statistically significant 
variance (p<0.004, Table 2). Having selected these 198 genes, another 
30 hierarchical clustering can be created by enforcing the classification of 17 tumors 
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(Figure 2). The results show that the tumors cluster together in 3 groups in 
complete agreement with the pre-assigned morphological classification. Samples 
from LCNEC cluster closer to TC than to SCLC and the tumor that contained 
features of small and large neuroendocrine cells clustered with SCLC which 
5 confirms the molecular relatedness identified by genome-wide expression in 

clinical behavior of these tumors. The results show that most of the 198 selected 
genes could be assigned to major functional groups that have been previously 
implicated in cancer development (Table 3). In particular, decreased expression of 
genes that oppose cell survival pathway, such as BCL2 antagonist-killer, BAK1, 

1 0 and caspase 4, are common in all 3 types of neuroendocrine tumors, whereas TC 
and LCNEC have an additional >2.5-fold decrease in expression of BAS and TNF 
receptor-interacting kinase, RIPK1. These features indicate that these tumors lack 
opposing effects on BCL2, as contrasted to overexpression of BCL2, which leads 
to survival advantage in certain types of lymphomas (Cleary, M.L. etal (1986) 

1 5 "Cloning And Structural Analysis Of Cdnas For Bcl-2 And A Hybrid 

BCL-2/lMMUNOGLOBULIN TRANSCRIPT RESULTING FROM THE T( 1 4; 1 8) 

Translocation, 5 ' Cell. 47(1): 19-28) (Figure 3). 



Tahle 2 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 


Description 


Gene Symbol 
(Map) 


Incyte 
Clone ID No. 


UG I 
Cluster 


Cluster #1 




166807 


glutamate receptor, ionotropic, 
AMPA 2 

Neuronal Marker, TM Receptor 


GRIA2 
[4q32-q33] 


IncytePD: 1505977 


Hs.89582 


159877 


carboxypeptidase E 
Secreted Lys Neuronal M 


CPE 
[4q32.3] 


lncytePD:21 53373 


Hs.75360 


161598 


origin recognition complex, subunit 
4 (yeast homolog)-like 


ORC4L 
[2q22-q23] 


lncytePD:2728840 


Hs.55055 


167158 


complement component 5 
Infl. Resp. VP. Extracellular 


C5 

[9q32-q341 


lncytePD:1712663 


Hs.1281 I 


Cluster #2 " ' 


167153 


gamma-glutamyl hydrolase 
(conjugase, 

folylpolygammaglutamyl hydrolase) 
Protease, Lys 


GGH 
[8q12.1] 


lncytePD:1997967 


Hs.78619 


160605 


P311 protein 

Invasion marker, Adhesion 
Plaques 


P311 
[5q21.3] 


lncytePD:1555545 


Hs. 142827 


169429 


nuclear receptor subfamily 3, 
group C, member 1 
Glucocort. Rec/TF 


NR3C1 
[5q31] 


lncytePD:629077 


Hs.75772 
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Table 2 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 


uescripiion 


Irene Symbol 
(TViap) 


Incyte 
Clone ID No. 


UG 
Cluster 


165192 


synaptojanin 2 
IP3 5-Phosphatase 


SYNJ2 
[6q25-261 


lncytePD:3954785 


Hs.61289 


165784 


adducin 3 (gamma) 
Cytoschel 


ADD3 

[10q24.2-q24.3] 


lncytePD:148l225 


Hs.324470 


163031 


KIAA0751 gene product 


KIAA0751 
[8q23.1] 


lncytePD:2369544 


Hs.153610 


166328 


proteasome (prosome, macropain) 
26S sub unit, ATPase, 6 
Proteasome 


PSMC6 
[12q15] 


lncytePD:1 488021 


Hs.79357 


168061 


formyltetrahydrofolate 
dehydrogenase 

NADPH Sx, Folic Acid One-carbon 
meth 


FTHFD 
[3q21.3] 


IncytePD:21 04145 


Hs,9520 


168141 


diacytglycerol kinase, gamma 
(90kD) 


DGKG 
[3q27-q281 


lncytePD:2568547 


Hs.89462 


165076 


PI-3-kinase-related kinase SMG-1 
RNA Survellance 


SMG1 
[16p12.31 


lncytePD:4253663 


Hs.110613 


167103 


TAF2 RNA polymerase II, TATA 
box binding protein (TBP)- 
associated factor, 150 kD 
TATA Box TF 


TAF2 
[8q24.l2] 


lncytePD:998069 


Hs.1 22752 


169391 


eukaryotic translation initiation 
factor 2, subunit 1 (alpha, 35kD ) 
polysome 


EIF2S1 
[I4q23.3] 


lncytePD:1224219 


Hs.151777 


166789 


zinc finger protein 202 
Transcriptional Repressor 


ZNF202 
[11q23.3I 


lncytePD:1 997937 


Hs.9443 


167316 


solute carrier family 24 
(sodium/potassium/calcium 
exchanger), member 1 
Sodium/potassium/calcium 
exchanger 


SLC24A1 
[15q22J 


lncytePD:2200079 


Hs 173092 


168700 


formyl peptide receptor-like 1 
Integram 

Membr/Migration/Expressed in 
Lung 


FPRL1 

[19q13.3-q13.4] 


lncytePD:523635 


Hs.99855 


165576 


interleukin 6 signal transducer 
(gp130, oncostatin M receptor) 


IL6ST 
[5q11] 


lncytePD:21 72334 


Hs.82065 


168276 


integrin, beta-like 1 (with EGF-like 
repeat domains) 


ITGBL1 
[13q33l 


lncytePD:1 258790 


Hs.82582 


169180 


interleukin 8 receptor, beta 


IL8RB 
[2q35] 


lncytePD:561992 


Hs.846 


160957 


protein kinase, AMP-activated, 
alpha 2 catalytic subunit 


PRKAA2 
[1P31] 


lncytePD:2507648 


Hs.2329 


160617 


colony stimulating factor 2 
receptor, beta, low-affinity 
(granulocyte-macrophage) 


CSF2RB 
[22q13.1l 


lncytePD:1561352 


Hs.285401 


160429 


PTK6 protein tyrosine kinase 6 
Non-Receptor, Sensitizes to EGF 


PTK6 
[20q13.3l 


lncytePD:3255437 


Hs.51133 


160237 


nuclear protein, ataxia- 
telangiectasia locus 
Osteogenesis Imperfecta 


NPAT 

[11q22-q23] 


lncytePD:2308525 


Hs.89385 


167125 


tumor necrosis factor receptor 
superfamily, member 6 


TNFRSF6 
[10q24.1) 


lncytePD:2205246 


Hs.82359 


164652 


platelet-derived growth factor 
receptor, beta polypeptide 


PDGFRB 
[5q31-q32l 


lncytePD:1821971 


Hs.76144 


161117 


ATP-binding cassette, sub-family 
G (WHITE), member 2 
Multidrug Resistance 


ABCG2 
[4q22] 


IncytePD:1501080 


Hs.1 94720 


161896 


collagen, type XV, alpha 1 


COL15A1 
[9q21-q22] 


lncytePD:4287342 


Hs.83164 " 
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1 Table 2 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 


Description 


Gene Symbol 
(Map) 


Incyte 
Clone ID No. 


UG 
Cluster 


1 KQD-t Q 

loyolo 


protein tyrosine phosphatase, non- 
receptor type 12 
PEST Dom; p-c-Abl, Ctx. Cell 
shape/motility 


PTPN12 
[7q11.23J 


lncytePD:1 382374 


Hs.62 


164573 


cyclin D binding Myb-like 

transcription factor 1 

Not reported to be Expressed in 

Lung 


DMTF1 
[7q21] 


lncytePD:1637517 


Hs 5671 


169384 


solute carrier family 22 (organic 
cation transporter), member 1-like 
antisense 

Organic-Cation Transporter-Like 2- 
Antisense 


SLC22A1LS 
[11p15.5] 


lncytePD:3842669 


Hs.300076 




ESTs, Weakly similar to 2109260A 
B cell growth factor [H.sapiensl 




IncytePD:3202075 


Hs.351699 


168169 


3-oxoacid CoA transferase 
mitochondrial matrix coenzyme A 
from succinyl-CoA to acetoacetate 


OXCT 
[5p13] 


lncytePD:1685342 


Hs. 177584 


165617 


prolactin receptor 


PRLR 
[5p14-p13l 


lncytePD:3427560 


Hs.1906 


169432 


interleukin 13 receptor, alpha 2 


IL13RA2 
[Xq13.1-q28l 


lncytePD:3360476 


Hs.25954 


1 OOOl ^ 


myelin protein zero-like 1 
extracellular membrane face 


MPZL1 
f1 023.21 


lncytePD:2057323 


Hs.287832 




runt-related transcription factor 3 


RUNX3 
[1P36] 


Incyte PD:885297 


H<5 170019 
no. i / uu i » 


167180 


S100 calcium-binding protein A4 
(calcium protein, calvasculin, 
metastasin, murine placental 
homolog) 

cell cycle progression, Associated 
with mets 


S100A4 
[1q21] 


lncvtePD'1222317 


no.O I £3D 


1 101 OOO 


cleavage stimulation factor, 3* pre- 

RNA, subunit 2, 64kD 

RNA processing/modification 


CSTF2 
[Xq21.33] 


lncytePD:4016254 


Hs.693 


165588 


small nuclear RNA activating 
complex, polypeptide 4, 190kD 


SNAPC4 
[9q34.3] 


lncytePD:2224902 


Hs.113265 


164799 


epithelial membrane protein 3 
cell-cell interactions. Promotes 
Apoptosis 


EMP3 
[I9q13.3] 


lncytePD:780992 


Hs.9999 


161709 


hypothetical protein FLJ11560 


FLJ11560 
f9p12l 


IncytePD: 1990361 


Hs.301696 


164868 


guanylate binding protein 2, 

interferon-inducible 

GTP-ase 


GBP2 

[1pter-p13.2] 


incytePD:1610993 


Hs.171862 


1 ouzoo 


dual-specificity tyrosine-(Y)- 
phosphorylation regulated kinase 3 
Cell growth, P-histones, 
Transcription 


DYRK3 
[1q32] 


lncytePD:614679 


Hs.38018 


165400 


hypothetical brain protein my040 j 
Overexp Lung neuroendocrine 
tumors 


MY040 
[7q35-q36] 


lncytePD:2048144 


Hs.1 24854 


165957 


pancreatic lipase-related protein 2 
Hydrolyse 


PNLIPRP2 ! 
[10q26.12] 


lncytePD:885032 


Hs.143113 


160054 


GTP-binding protein homologous 
to Saccharomyces cerevisiae 
SEC4 

Sec vesicles SC 


SEC4L 
[17q25.3] 


IncytePD: 1824556 


Hs.302498 


162475 


cancer/testis antigen 2 
melanomas, non-small-cel! lung 
carcinomas, bladder, Prostate, H/N 


CTAG2 
[Xq28] 


lncytePD:849425 


Hs.87225 
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Table 2 | 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor | 

Cells 


Unique 
ID No. 


Description 


Gene Symbol 
(Map) 


Incyte 
Clone ID No. 


UG 
Cluster 


169182 


testis-specific ankyrin motif 
containing protein 


[7q31l 


lncytePD:201 3272 


Hs.73073 


162912 


nectin 3 

PVRL1; may be a membrane 
glycoprotein 


DKFZP566B084 
[3q13] 


lncytePD:2680168 


Hs.21201 


163475 


hypothetical protein 
7q22.1 102-113 


FU20485 
[7q22.1l 


lncytePD:2299818 


Hs.98806 


164927 


heterogeneous nuclear 

ribonucleoprotein AO 

RNA processing/modification 


HNRPAO 

[5q31] 


incytePD:637639 


Hs. 77492 


160630 


homeo box D9 

RNA processing/modification 


HOXD9 
[2q31-q37l 


lncytePD:2956581 


Hs.236646 


160367 
163762 


v-jun avian sarcoma virus 17 
oncogene homolog 
Associated with transl in Tumors 
ESTs 


JUN 

[1p32-p31] 


IncytePD: 1969563 


Hs.78465 


162247 


very large G protein-coupled 
receptor 1 

transports Ca2+ during excitation- 
contraction 


F171 

VLGR1 

[5q13] 


lncytePD:1 743234 
lncytePD:942207 


Hs.120854 
Hs.1 53692 


167219 
Cluster #3 


pumiiio (Drosophila) homolog 1 


PUM1 
[1p35.2] 


lncytePD:3333130 


Hs.1 53834 


165171 


keratin 18 


KRT18 
[12q13l 


IncytePD: 1435374 


Hs.65114 


165052 


CDC20 (cell division cycle 20, S. 

cerevisiae, homolog) 

Cell cycle, microtubule-dependent 

processes 


CDC20 
[9q13-q21] 


lncytePD:2469592 


Hs.82906 


167948 


pim-1 oncogene 

S.T kinase Hematop Cells 


PIM1 
[6p21.2l 


IncytePD:2679117 


Hs.81170 


161954 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
21kD 

Vacuolar H Transporter 


ATP6F 
[1p32.3] 


lncytePD:5017148 


Hs.7476 


162391 


polymerase (DNA directed), 
epsilon3(p17subunit) 
DNA Replication 


POLE3 
[9q33] 


lncytePD:961082 


Hs 108112 I 


166635 


keratin 5 (epidermolysis bullosa 
simplex, Dowling- 
Meara/Kobner/Weber-Cockayne 
types) 


KRT5 

[12q12-q13] 


lncytePD:3432534 


Hs. 195850 


160035 


flap structure-specific 

endonuclease 1 

DNA Repair/UV rad protection 


FEN1 
[11q12] 


lncytePD:2050085 


Hs.4756 


161774 


calcium and integrin binding 
protein (DNA-dependent protein 
kinase interacting protein) 


SIP2-28 
[15q25.3-q26] 


lncytePD:4626895 


Hs.10803 


162207 


membrane protein of cholinergic 
synaptic vesicles 
vesicular transport 


VATI 
[17q21] 


lncytePD:2060308 


Hs. 157236 


161163 


guanyiate kinase 1 
Sx GTP/GMP 


GUK1 
[1032-0411 


lncytePD:2506427 


Hs.3764 | 


161223 


CD27-binding (Siva) protein 
tumor necrosis receptor (TFNR) 
superfamily 


SIVA 
(22] 


lncytePD:2356635 


Hs. 112058 


161211 


capping protein (actin filament), 
gelsolin-like 


CAPG 
[2cen-q24] 


JncytePD:2508570 


Hs.82422 


161948 


claudin 1 1 (oligodendrocyte 
transmembrane protein) 


CLDN11 
[3q26.2-q26.31 


lncytePD:41 44001 


Hs.31595~ 


161391 


interieukin 17F 


IL17F 

[6P12] I 


lncytePD:1610083 


Hs.272295 
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Table 2 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 


Description 


Gene Symbol 
(Man) 


Incyte 
Clone ID No. 


UG 
Cluster 


162571 


phosphofructokinase, liver 


PFKL 
[21q22,3l 


lncytePD:885601 


He 

no. I OOHOO 


164504 


cathepsln C 
Lys Prot Degr 


CTSC 

[11q14.1-q14.3] 


IncytePD:1822716 


Hs. 10029 


160565 


aminoacylase 1 
L-aa Sx salvage path 


ACY1 
[3p21.11 


lncytePD:1812955 


Hs.334707 | 


169551 


glycogen synthase kinase 3 beta 
target of Akt, I1k1 , Reg jun, myb, 
etc. 


GSK3B 
[3q13.3] 


lncytePD:2057908 


no. i OOU& 


166914 


methyltransferase-Iike 1 
S-adenosylmethionine-bindinq mo 


METTL1 
n2q13l 


IncytePD: 1603584 


Hs.42957 


167738 


cytochrome P450, subfamily 
XXVIIB (25-hydroxyvitamln D-1- 
alpha-hydroxylase), polypeptide 1 
drug metabolism and synthesis of 
cholesterol, steroids 


CYP27B1 
[12q13.1-q13.3] 


lncytePD:1 749727 


Hs. 199270 


160938 


GrpE-like protein cochaperone 
cooperates with mitochondrial 
hsp70 i 


HMGE 
[4p16] 


lncytePD:2074154 


Hs.151903 


162734 


wingless-type MMTV integration 
sjte family, member 7A 
Regulates Steroid responses 


WNT7A 
[3p25] 


lncytePD:2622566 


Hs.72290 


165813 


caspase 4, apoptosis-related 
cysteine protease 


CASP4 

[11q22.2-q22.3l 


lncytePD:2304121 


Hs.74122 


159898 


pituitary tumor-transforming 1 


PTTG1 
[5q35.1l 


IncytePD: 1 748705 




161244 


ADP-ribosylation factor 4-like 
GTP-binding proteins. ARF4L is c 


ARF4L 
[17q12-q21l 


lncvtePD'2852403 


no. 1 oo 1 DO 


160715 


cell division cycle 34 


CDC34 
[19p13.3] 


lncytePD:1 857493 


Hq 7fiq*39 ! 


163787 


pyrro!ine-5-carboxylate reductase 
1 j 
Proline Sx 


PYCR1 
(17q24) 


lncytePD:1 702266 


Hs.79217 


160127 


phosphoglycerate mutase 1 (brain) 


PGAM1 
[10q25.3l 


lncytePD:3032691 


Hs.181013 


160323 


5-aminoimidazole-4-carboxamide 

ribonucleotide 

formyltransferase/IMP 

cyclohydrolase 

Purine BioSx 


ATIC 
I2q35] 


lncytePD:2056149 


Hs.90280 


164850 


interleukin-1 receptor-associated 
kinase 1 


IRAKI 
[Xq28] 


lncytePD:1 872067 


Hs.182018 


165583 


7-dehydrocholesterol reductase 


DHCR7 

Pl1q13.2-q13.5l 


incytePD:3518380 


Hs.11806 


165039 


thymidine kinase 1, soluble 
two forms have been Identified in 
animal cells 


TK1 ^ 
I17q23.2-q25.3] 


lncytePD:2055926 


Hs.105097 ; 


167964 


cydin-dependent kinase inhibitor 
2A (melanoma, p16, inhibits 
CDK4) 


CDKN2A 
[9p21 


lncytePD:2740235 


Hs.1174 


167223 


guanine nucleotide binding protein 
(G protein), beta polypeptide 1 
Ras GTPase, Contains 7 wd 
repeats 


GNB1 

[1p36.21 -36.33] j 


lncytePD:3562795 


Hs. 21 5595 


167931 


cleavage stimulation factor, 3' pre- 
RNA, subunit 1, 50kD 
RNA, transducin-like repeats 


CSTF1 
[20q13.2] 


lncytePD:1635008 


Hs. 172865 


163690 


hexabrachion (tenascin C, 
cytotactin) 


HXB 
[9q33] 


lncytePD:1 453450 


Hs.289114 


161955 


contactin 2 (axonal) 


CNTN2 
[1Q32.11 


lncytePD:4014715 


Hs.2998 
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1 Table 2 


{ Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 


Unique 
ID No. 


npcorintinn 
Ase&CripilOZl 


Gene Symbol 
(Map) 


Incyte 
Clone ID No. 


UG 
Cluster 


1 60275 


structure specific recognition 
protein 1 


SSRP1 
[11q12 


lncytePD:2055773 


Hs 79162 


168110 


TAF12 RNA polymerase II, TATA 
box binding protein (TBP)- 
associated factor 20 kD 


TAF12 
[1p35.1] 


IncytePD: 1297269 


Hs.82037 


160102 


protein disulfide isomerase related 
protein (calcium-binding protein, 
intestinal-related) 
Sevretion; ER 


ERP70 
[10] 


IncytePD: 1824957 


Hs 93659 


167116 


nucleoside phosphorylase 
adenosine deaminase (ADA) 
serves a key role in purine 
catabolism; Def=SCJD 


NP 

[14q13.1] 


lncytePD:2453436 


Hs.75514 


160802 


prohibitin 

Tumor suppressor, Blocks DNA 
Sx; Breast CA 


PHB 
[17q21J 


IncytePD:1625169 


Hs.75323 


161643 


ADP-ribosylation factor-like 7 
GTP-binding protein 


ARL7 
T2q37.2l 


lncytePD:31 15514 


Hs.1 11554 


162343 


LIM domain kinase 2 
Rho-induced reorganization of the 
actin cytoskeleton 


LIMK2 
[22q12.2] 


lncytePD:958513 


Hs.278027 


162727 


protein tyrosine kinase 9-Iike (A6- 
related protein) 


PTK9L 
[3p21.1] 


lncytePD:3999291 


Hs.6780 


160262 


DEAD/H (Asp-Glu-Ala-Asp/His) 

box polypeptide 28 

probable atp-bindlng rna helicase 


DDX28 
[16q22.1] 


lncytePD:2663948 


Hs.1 55049 


165790 


surfeit 1 

Mit. Resp Enz 


SURF1 
[9q33-q34] 


lncytePD:1921567 


Hs.3196 


168638 


histone deacetylase 7A 


HDAC7A 
[12q13.1] 


IncytePDH 968721 


Hs.275438 


168079 


epithelial membrane protein 1 
cell-cell interactions. Promotes 
Apoptosis 


EMP1 
[12p12.3] 


lncytePD:1 624024 ' 


Hs.79368 


160999 


Rho-specific guanine nucleotide 

exchange factor p1 14 

cell growth and motility; Dbl, PH 

dom 


P114-RHOGEF 
[19p13.3] 


incytePD:1734113 


Hs.6150 I 


161790 


KIAA0469 gene product 


KIAA0469 
[1p36.231 


lncytePD:2674277 


Hs.7764 


169691 


ubiquitin carrier protein 
E2 enzyme activity 


E2-EPF 
[17p12-p1 11 


IncytePD:2057823 


Hs.1 74070 


163682 


diptheria toxin resistance protein 
required for diphthamide 
biosynthesis (Saccharomyces)-tike 
2 


DPH2L2 
[1p34] 


lncytePD:1810821 


Hs.324830 


168266 


proteasome (prosome, macropain) 
activator subunit 3 (PA28 gamma; 
Ki) 


PSME3 
[17q12-q21] 


lncytePD:1308112 


Hs.1 52978 


161374 


polymerase (DNA-directed), alpha 
(70kD) 

RNA Processing 


POLA2 
[11q13.1] 


lncytePD:3179113 


Hs.81942 


164646 


galactose-4-epimerase, UDP- 
Rate-Iim for Sx glycoproteins and 
glycolipids 


GALE 
[1p36-p35] 


lncytePD:1 807294 


Hs.76057 


162150 


apolipoprotein L 


APOL1 
[22q13.1l 


lncytePD:2056987 


Hs.1 14309 


164206 


type I transmembrane protein Fn14 
similar to murine Fqfrp2 


FN14 
[16P13.31 


lncytePD:1402615 


Hs.1 0086 


162623 


BCL2-antagonist/kilier 1 


BAK1 
[6p21.3l 


lncytePD:2055687 


Hs.93213 


162244 


Rho GDP dissociation inhibitor 
(GDI) alpha 


ARHGDIA 
(17q25.3] 


lncytePD:2055640 


Hs.1 59161 
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Table 2 


J Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 

164586 


Description 


Gene Symbol 
(Map> 


Incyte 
Clone ID No. 


UG 
Cluster 


inosine triphosphatase (nucleoside 
triphosphate pyrophosphatase) 
Ins Phos phosphatase 


ITPA 
[20pJ 


lncytePD:1931265 


Hs.6817 


165483 


PDGFA associated protein 1 
Enhances PDGFA 


PDAP1 
[7Q22.1] 


lncytePD:3032825 


Hs.278426 


166195 


adenine phosphoribosyltransferase 
Sx AMP purine/pyrimidine Met 


APRT 
n6q24l 


lncytePD:2751387 


Hs.28914 


166960 


Apg12 (autophagy 12, S. 
cerevisiae)-like 


APG12L 
[5q21-q22l 


lncytePD:2058537 


Hs.264482 


167505 


thiosulfate sulfurtransferase 

(rhodanese) 

Mltoch detox cyanide 


TST 

[22q13.1] 


lncytePD:1 988239 


Hs.351863 


168642 


suppression of tumorigenicity 14 
(colon carcinoma, matriptase, 
epithin) 
Protease ECM 


ST14 

[11q24-q25I 


lncytePD:478960 


Hs.56937 


167170 


GS2 gene 


DXS1283E 
[Xp22.31 


incyte PD: 1567995 


Hs.264 


161754 


actin, gamma 2, smooth muscle, 
enteric 


ACTG2 
[2p13.1l 


lncytePD:3381870 


Hs.78045 


166010 


receptor (TNFRSF)-interacting 
serine-threonine kinase 1 


RIPK1 
[6p25.3] 


lncytePD:21 80031 


Hs.296327 


161794 


secretory carrier membrane protein 
2 

Vesic Traff, Secretpry path 


SCAMP2 
[15q23-q25] 


lncytePD:31 23858 


Hs.238030 


167591 


catechol-O-methyltransferase 
Sx dopamine, epinephrine, and 
norepinephrine 


COMT 
[22q11.21] 


incytePD:605019 


Hs.240013 


1 62587 


polymerase (RNA) li (DNA 
directed) polypeptide D 
RNA Processing 


POLR2D 
[2q21] 


lncytePD:696002 


Hs. 194638 


169071 


capping protein (actin filament) 
muscle Z-Iine, beta 


CAPZB 
[1p36.1] 


lncytePD:1853163 


Hs.333417 


160467 


polymerase (DNA directed), delta 
2, regulatory subunit (50kD) 
RNA Processing 


POLD2 
[7p13] 


lncytePD:2056172 


Hs.74598 


162178 


C2f protein 


C2F 
[12p13l 


lncytePD:5096975 


Hs.12045 J 


1 67706 


GDP-mannose pyrophosphorylase 
B 

N-linked oligosaccharides 


GMPPB 
[3p21.31] 


lncytePD:1486983 


Hs.28077 


160803 


phenylalanine-tRNA synthetase- 
like 

Reg. in tumors and cell cycle 


FARSL 
[19p13.2] 


lncytePD:1 808260 


Hs.23111 


169254 


polymerase (DNA directed), mu 
RNA Processing 


POLM 
[7 P 13] 


lncytePD:771715 


Hs.46964 


167351 


myosin-binding protein H 


MYBPH 
[1q32.1l 


incytePD:3010959 


Hs.927 


163276 


ESTs, Weakly similar to I37356 
epithelial microtubule-associated 
protein, 115K [H.sapiensJ 


[7] 


lncytePD:2383065 


Hs.25892 


167135 


excision repair cross- 
complementing rodent repair 
deficiency, complementation group 
1 (includes overlapping antisense 
sequence) 


ERCC1 

[19q13.2-q13.3] 


lncytePD:2054529 | 


Hs.59544 


160478 


G5b protein 


G5B 
[6p21.3] 


IncytePD: 1942845 ; 


Hs.73527 


162631 


transcriptional adaptor 3 (ADA3, 
yeast homolog)-Iike (PCAF histone 
acetylase complex) 
PCAF histone acetilase complex 


TADA3L 
[3p25.2] 


lncytePD;3990209 


Hs.158196 
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Table 2 1 


Genes 


Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 

Cells 


Unique 
ID No. 


Description 


Gene Symbol 
(Map) 


Incyte 
Clone ID No. 


UG 
Cluster 


163921 
D 160098 


glucosamfne-6-phosphate 

isomerase 

Hydrolase 


GNPI 
[5q21] 


lncytePD:1 653911 


Hs 278500 


y i www/ 


mitochondrial ribosomal protein 
L49 


MRPL49 
[11Q13] 


Incyte PD: 1755793 


Hs.75859 


161058 


multiple endocrine neoplasia I 


MEN1 
[11q13] 


lncytePD:1693847 


Hs.24297 


160038 


BCL2-antagonist of cell death 


BAD 

[11q13.1] 


lncytePD:3967780 


Hs.76366 


162220 


FK506-bindlng protein 1A (12kD) 
Interacts with TGF beta 


FKBP1A 
T20p13] 


lncytePD:4059193 


Hs.349972 


161026 


Xq28, 2000bp sequence contg. 
ORF 

3* eDNA Repair xonuclease activity 


HSXQ280RF 
[Xq28] 


IncytePD: 1669254 


Hs.6487 


167607 


heat shock protein 75 
HSP90 fam, Binds to TNFR 


TRAP1 
[16p13.3l 


IncytePD: 1 960722 


no. i ouu I 


167713 


likely ortholog of maternal 
embryonic leucine zipper kinase 
regulation of fatty acid synthesis 


KIAA0175 
[9p11.2] 


lncytePD:3805046 


Hs 18433Q 


1 65648 


dual specificity phosphatase 4 
negatively regulate MAPK. Anti- 
oncogene 


DUSP4 
[8p12-p11] 


lncytePD:740878 


Hs i 


161574 


frequently rearranged in advanced 
T-cell lymphomas 2 
prevent gsk-3-dependent 
phosphorylation 


FRAT2 

[10q23-q24.1] 


lncytePD:3871545 


Hs.140720 


161650 


KIAA0415 gene product 


KIAA0415 
[7p22.2] 


lncytePD:2798872 


Hs.229950 


1 UOOOO 


nucleolar and coiled-body 
phosphprotein 1 


NOLC1 
[10] 


lncytePD:1431819 


Hs.75337 J 


159906 


H2A histone family, member X 


H2AFX 

[11q23.2-q23.3] 


I ncvte PD* 1 704 1 fifi 


Lie 1A7nQ7 


167906 


RAE1 (RNA export 1, S.pombe) 
homolog 

RNA export from the N 


RAE1 
[20q13.31] 


lncvtePD*291 471 9 


no. I oO^Uo 


I D u*N30 


deltex (Drosophila) homolog 2 
collagen type Hi 


DTX2 
[7q11.23] 


IncytePD: 1691 161 


Hs.89135 


I QUO / O 


v-maf musculoaponeurotic 
fibrosarcoma (avian) oncogene 
family, protein G 
transcriptional regulator 


MAFG 
[17q25] 


!ncytePD:2956906 


Hs.252229 


159889 


fusion, derived from t(12;16) 
malignant liposarcoma 
DMA Sx atp-independent 
annealing of complementary 

Sinolp- ^tranrlpH Hnae 


FUS 

[16p11.2] 


lncytePD:3038508 


Hs.99969 


167553 


ligase I, DNA, ATP-dependent 
DNA excision repair process 


LIG1 

[19q13.2-q13.3l 


lncytePD:1841920 


Hs.1770 


163824 


uracil-DNA glycosylase 
DNA Base-excision repair 


UNG 

[12q23-q24.1l 


IncytePD: 1405652 


Hs.78853 


161012 


GCN1 (general control of amino- 
acid synthesis 1 , yeastHike 1 


GCN1L1 
[12q24.21 


IncytePD: 16991 49 


Hs.75354 


162006 


regenerating isiet-derived 1 beta 
(pancreatic stone protein, 
pancreatic thread protein) 
brain and pancreas regeneration 


REG1B 
[2p12] 


lncytePD:2374294 


Hs,4158 


161454 


serine protease inhibitor, Kunitz 
type 1 

Secreted S/Protease; proteolytic 
activation of HGF 


SPINT1 
[15q13.3] 


lncytePD:2722572 


Hs.23395(T~ 
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Table 2 



Genes Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 
, Cells 



I Unique 
1 ID No. 

162510 


Description 


Gene Symbol 
\iviapj 


Incyte 
Clone ID No. 


UG 
Cluster 


calcium/calmodulin-dependent 
protein kinase kinase 2, beta 

Q/T* Prntoin Minora 

o/ 1 rioicin Kinase 


CAMKK2 
[12] 


IncytePD:557451 


Hs.1 08708 


163306 


Bloom syndrome 
DNA Repair 


BLM 

[15q26.1] 


lncytePD:2923082 


Hs.36820 




rviMM, u iransporter i 


RNUT1 


lncytePD:1 562658 


Hs.21577 . 


164106 


glutamate rich WD repeat protein 

VJlNV V LJ 

RNA stability 


GRWD 
[19q13.33] 


lncytePD:1561867 


Hs.218842 


165799 


ivtr\u ^moiners against 
decapentaplegic, Drosophila) 
homolog 3 

TF, activated by tgf-beta 


MADH3 
[15q21-q22] 


IncytePD: 1858365 


Hs.211578 


166574 


small nuclear RNA activating 
complex, polypeptide 4oKU 
RNA Processing 


SNAPC2 
[19p13.3-p13.2] 


lncytePD:1445203 


Hs.78403 


160441 


lymphotoxin beta receptor (TNFR 
supenamiiy, memoer 3) 
TNF family of receptors 


LTBR 
[12p13] 


lncytePD:899102 


Hs.1116 




transforming, acidic coiled -coil 
containing protein 3 
Upregulated in Tumors 


TACC3 
[4p16.3] 


lncytePD:2056642 


Hs.104019 




proteasome (prosome, macropain) 
26S subunit, ATPase, 4 


PSMC4 

[19q13.11-q13.13l 


lncytePD:2806778 


Hs.211594 


169564 


SWI/SNF related, matrix 
associatea, actin Dependent 
regulator of chromatin, subfamily d, 

TF 


SMARCD2 
[17q23-q24] 


lncytePD:1890919 


Hs.250581 


161178 


udoiyiii \\jf\ uioou groupj 
Induces MMTP; p-regulated in 
gliomas 


BSG 

[19p13.3] 


I ncytePD:2 182907 


Hs.74631 


165614 


junction plakoglobin 


JUP 
[I7q21l 


lncytePD:820580 


Hs.2340 


168987 


HMT1 (hnRNP methyltransferase, 
Protein methylation 


HRMT1L2 
[jyqi J.3J 


lncytePD:2888814 


Hs.20521 


167987 


ectonucleoside triphosphate 

dinhn^nhnhx/rtrrtlaco 1 

ATP hydrolysis, Pit aggregation 


ENTPD1 
[i uq^4j 


lncytePD:1 672749 


Hs.205353 


1 63726 


uui 1 1 pit!j 1 1 ci i l curnponsni o 


C3 

[19p13.3-p13.2l 


IncytePD: 151 3989 


Hs.284394 


164642 


tyrosyl-tRNA synthetase 


YARS 
[1p34.3| 


IncytePD: 1559756 


Hs.239307 


| 160303 


Ets2 repressor factor 


ERF 
[19q13l 


lncytePD:2057547 


Hs.333069 


161635 


G protein-coupled receptor 


TYMSTR 
[3p21] 


lncytePD:2610374 


Hs.34526 


159859 


nuclear autoantigen 
wd REPEAT PROTEIN 


GS2NA 
[14q13-q21] 


lncytePD:1339241 


Hs.183105 


161051 


MAP/microtubule affinity-regulating 
kinase 3 

S/T Protein kinase 


MARK3 
[14q32.3J 


lncytePD:2395018 


Hs.1 72766 


161835 


peroxisome biogenesis factor 1 0 


PEX10 

[1p36.11-1p36.33] 


lncytePD:31 15936 


Hs.247220 


165571 


annexin A3 

calcium-dependent phosphotipid- 
bfndlng 


ANXA3 
[4q13-q223 


lncytePD:1 920650 


Hs.1 378 


164286 


nuclear factor of kappa light 
polypeptide gene enhancer in B- 
cells inhibitor, epsilon 


NFKBIE 
[6p21.1] 


lncytePD:2748942 


Hs.91640 
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Table 2 



Genes Having Statistically Significant Variance in Expression in Neuroendocrine Tumor 
. Cells 



Unique 
ID No. 


Description 


Gene Symbol 
(Mao) 


Incyte 
Clone ID No. 


UG 
Cluster 


1 65786 


hx/ali iron And i/>ncamlnir4qpa O 

Degrades glycosaminoglycans of 
the extracellular maMy 


HYAL2 
[3p21.3] 


lncytePD:1240748 


Hs.76873 


161620 


H4 histone family, member E 


H4FE 

[op^-p^l .3] 


IncytePD:3728255 


Hs.278483 


168302 


Tax interaction protein 1 
1 pdz/dhr domain 


TIP-1 
[17P13] 


lncytePD:1997792 


Hs.12956 


160887 


pescadillo (zebrafish) homolog 1, 
wi nan hi iy oiau i uomain 
embrional dev 


PES1 
I22q12.1] 


lncytePD:2758740 


Hs.13501 


162419 


RAR1 /RMA avnnrt -1 C n/\mka\ 

rvnt i ^rviv\ export i , o.pomDej 
homolog 


RAE1 
[20q13.31J 


lncytePD:588157 


Hs.1 96209 


169625 


replication factor C (activator 1) 4 
(37kD) 

DNA Sx/Repair 


RFC4 
[3q27] 


)ncytePD:1 773638 


Hs.35120 


163425 


transcription elongation factor A 
(Sll), 2 


TCEA2 
[20] 


lncytePD:818568 


Hs.80598 


166359 


tubulin, beta polypeptide 
Testls-specific 


TUBB 
[6p21.3] 


lncytePD:3334367 


Hs.336780 


161947 - 


translocase of inner mitochondrial 
membrane 17 homolog B (yeast) 
Integral Mitoch. Expr. In 
Neuroendocr Lung CA 


T1M17B 
[Xp11.23] 


lncytePD:1 727491 


Hs.19105 


162236 


KIAA0670 protein/acinus 


KIAA0670 
[14q11.1l 


lncytePD:1 968610 


Hs.227133 


168426 


glioma pathogenesis-related 
protein 


RTVP1 
[12Q151 


lncytePD:477045 


Hs.64639 



Characteristics Of The Gene Expression Patterns In Pulmonary 
Neuroendocrine Tumors. The present invention permits investigation of whether 
expression of genes significantly altered in neuroendocrine tumors correlates with 
clinical behavior of these tumors. The results show that most of 198 selected genes 
could be assigned to major functional groups that have been previously implicated 
in cancer development (Table 3). In particular, decreased expression of genes that 
oppose cell survival pathway, such as BCL2 antagonist-killer, BAK1, and caspase 
4, are common in all 3 types of neuroendocrine tumors, whereas TC and LCNEC 
have an additional >2. 5-fold decrease in expression of BAD and TNF receptor- 
interacting kinase, RIPK1 . These features indicate that these tumors lack opposing 
effects on BCL2, as contrasted to overexpression of BCL2, which leads to survival 
advantage in certain types of lymphomas (Cleary, M.L. et al (1986) "Cloning 
And Structural Analysis Of Cdnas For Bcl-2 And A Hybrid Bcl- 

2/lMMUNOGLOBULIN TRANSCRIPT RESULTING FROM THE T(l 4; 1 8) 

Translocation," Cell. 47(l):19-28). 
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Genes involved in regulation of cell-cell and extracellular matrix 
interactions, claudin 1 1 (CLDN1 1), contractin-2, (CNTN2), keratin 5 and 18 (KRT 
5 and 18), calcium and integrin binding protein (SIP2-28), and junction 
plakoglobulin (JUP) are also suppressed in TC and LCNEC tumors, and, to a lesser 
5 degree, in SCLC. The dominant group of genes is involved in transcriptional 
regulation and DNA synthesis and repair. Decrease in expression of Bloom 
(BLM) is shared by TC and LCNEC, whereas DNA excision repair (ERCC1) and 
DNA ligase-1 (LIG) are suppressed in all tumor types. Other groups of genes 
manifesting decreased expression in all tumors are genes involved in cell cycle 

10 control (CDC34, pl6/CDK inhibitor 2A), suppressor of MAPK pathway (dual 
specificity phosphatase, DUSP4), antioncogenes, such as epithin (ST14), and 
prohibitin, (PHB). Decreased expression of genes involved in microtubular 
assembly, beta tubulin polypepetide B (TUBB) in conjunction with overexpression 
of ATP-binding cassette protein (ABCG2) and gamma glutamyl hydrolase (GGH), 

1 5 could confer well-known resistance of these tumors to chemotherapy, specifically 
to taxol-related drugs. Decreased expression of genes associated with the ubiquitin 
pathway, such as proteasome subunit 26S (PSMC4), and proteasome activator 
subunit 3 (PSME3), correlates with potential resistance to newly developed 
proteasome inhibitors. The decrease in expression of these genes can affect NFkB 

20 activity, drug resistance and other functions in these tumors. 

Only a fraction of genes identified herein is significantly over-expressed. 
Expression of a neuroendocrine peptide processing enzyme, carboxypeptidase E 
(CPE), inotropic glutamate receptor (GPJA2) and a complement component 5 are 
increased 4-6-fold in TC. In addition, TC has a modest increase in expression of 

25 the IL8 receptor B, IL8RB (1 .61 -fold), and that of the interleukin 6 signal 

transducer chain common to several interleukin receptors, gpl30 (Oncostatin M, 
IL6ST), which is elevated at a mean of 1 .34-fold in the 1 1 samples from TC. In 
contrast, LCNEC, have over 20 genes whose expression is above 1.9-fold or higher 
(Figures 3A and 3B). These gene products are increased specifically in LCNEC 

30 and included colony stimulating factor receptor (CSF2R), IL 13 receptor 

(IL13RA2), IL-8 receptor beta (IL8RB) as well as the IL 6 signal transducer, 
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gpl30 (Oncostatin M, IL6ST) and gamma-glutamyl hydrolase (GGH), which has 
been associated with drug resistance. In addition, LCNEC have a six-fold over- 
expression of a neuronal marker, P3 1 1, recently identified as a marker of 
aggressive gliomas. P31 1 may have a role in defining a metastatic/invasive 
5 potential in LCNEC. In contrast to LCNEC, analysis of SCLC shows only modest 
increase in 25 genes, none of which exceeded 1.5-fold increase. The lack of 
detection of over-expressed genes in SCLC reported herein could reflect a 
qualitative change in oncogenic mutations, such as p21 ras , p53 and others which are 
found in SCLC (Wistuba, LI. et al. (2001) "Molecular Genetics Of Small 
10 Cell Lung Carcinoma," Semin. Oncol. 28: 3-13) or due to limited number of 
samples used. 





Table 3 


Unique ID No. of 


Expression of Genes in Large Cell (LC), Small Cell 


Gene 




(SC) and Typical Carcinoma (TO Cells 




(LOH) 


LC 


I SC 


TC 


Apoptosis 






167125 


Yes 


3.23 


0.88 


1.36 


| 162623 


Yes 


0.23 


0.51 


0.13 


160038 


Yes 


0.47 


1.04 


0.32 


165813 


0.59 


0.75 


0.28 


168079 


0.46 


0.93 


0.25 


164799 


Yes 


1.2 


0.73 


0.64 I 


160441 


0.37 


0.49 


0.18 


161223 


0.2 


0.71 


0.11 


166010 


0.45 


0.99 


0.28 


167607 


0.4 


0.81 


0.23 


166960 


0.17 


0.37 


0.09 


Cell-Cell And ECM Interact 


ions 




168700 


Yes 


1.91 


0.82 


1.69 


168276 


1.61 


0.63 


1.21 


162912 


0.82 


0.7 


1.27 j 


161896 


2.12 


0.75 


1.04 


159813 


1.99 


0.83 


1.22 


166812 


0.93 


0.78 


0.78 


165171 


0.3 


0.16 


0.05 


166635 


0.18 


0.63 


0.11 


161774 


Yes 


0.2 


0.57 


0.11 


161211 


0.27 


0.64 


0.12 I 


| 161948 


0.19 


0.56 


0.09 


162734 


0.73 


1.01 


0.32 


163690 


0.42 


0.82 


0.23 


161955 


0.17 j 


0.38 


0.09 i 


164206 


0.26 


0.53 


0.11 


| 168642 


0.55 


0.96 


0.3 


| 160486 


0.37 


0.72 


0.19 
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Table 3 1 


Unique ED No. of 
Gene 


Expression of Genes in Large Cell (LC), Small Cell 

(SC) and Typical Carcinoma (TC\ rvn* 1 


Uene Family 


(LOH) 


LC 


SC 


TC ! 


161178 


Yes 


0.52 


1.05 




165614 


Yes 


0.32 


0.82 


u.oo 


167987 


Yes 


0.58 


1.03 




165786 


0.56 


0.94 


u.oo 


164504 








DNA Synthesis and Repair 






163306 


0.57 


0.98 


u.ou 


167135 


Yes 


] 0.34 


0.63 


0.2 


160035 


0.21 


0.72 


2lU 


160262 


0.19 


0.58 


0.12 


I 161026 


0.54 


0.78 


0.28 


159889 


0.33 


0.79 


0 22 


1 67553 


Yes 


0.34 


0.67 


0.23 


163824 


0.39 


0.79 


0.24 


169625 


0.98 


0.88 


0.44 


Cell Cycle 






167964 


0.15 


0.33 


0.08 


160715 Yes 


0.33 


0.94 


0.17 


167180 


1.54 


1.37 


1.17 


165052 


0.18 


0.6 


0.08 


162391 


0.17 


0.6 


0.11 


162631 


0.43 


1.06 


0.38 | 


168638 


0.21 


0.58 


0.14 | 


Anti-Oneogenes 




161058 


Yes 


0.72 


1.25 


0.39 


f 165648 


0.31 


0.6 


0.19 


169551 
160802 
161574 
Oncogenes 
160429 
J 167948 


Yes 
Yes 


0.47 
0.16 
0.6 

2.54 
0.61 


0.8 

0.44 

1.05 

0.71 
1.16 


0.26 
0.09 

r 0.4 

0L94 

0.28 i 


loyuatt Yes 
165799 Yes 
Cytoskeleton/Migration 
160999 Yes 
161754 

169071 Yes 
| 167351 


0.28 
0.53 

0.42 
0.53 
0.3 
0.39 


0.42 
0.67 

0.91 
1.11 
0.72 
0.69 


n no 5 

U.Uo 

0.27 

0.24 
0.35 
0.21 
0.26 


152343 
162727 


Yes 


0.33 
0.2 


0.67 
0.45 


0.17 ; 
0.11 


165784 


Yes 


1.46 


0.69 


1.96 | 


160605 


5.94 


0.84 


1.06 


Proteasorae 






f 166328 


1.14 


0.72 


2,12 


169691 Yes 


0.15 . 


0.34 


0.09 


| 168266 


Yes 


0.2 


0.45 


0.1 


164244 


Yes ! 


0.43 


0.67 


0.22 


Drug Resistance 






161117 


2.52 


0.75 


1.12 


167738 


0.32 


0.64 


0.18 i 


167505 


0.39 


0.77 


0.21 | 


166359 


Yes 


0.46 


0.64 


0.28 
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Table 3 


Unique ID No. of 
Gene 


Expression ot Genes in Large Cell (LC), Small Cell 
(SC) and Typical Carcinoma (TP) r>ii s 


oene Family 


(LOH) 


LC 


SC 


TC I 


167153 


6.27 


1 


1 .OA 


168061 


1.32 


0.64 


1.23 


Growth Factors/Receptors i 


\nd Signal Transduction Enzymes 




165576 


1.93 


0.66 


•1 1A 

1 .o4 


169180 


1.88 


0.86 


1 AH 

I .Oi | 


160617 


3.57 


0.86 


n qi 

u.yo j 


164652 


2.63 


0.97 


1.18 


165617 


r 2.9 


0.73 


1.32 


I 169432 


2.04 


0.65 


1 f\A 


161391 


0.43 


I 0.83 


n on 


164850 


0.2 


0.45 


n no 
u.uy 


165483 


0.33 


l_ 0.98 


n 01 


162006 


0.29 


0.71 


A O 

u.z 


161454 


0.58 


0.99 


n oq 


168453 


0.35 


0.59 


n <io 
U. IO 


162220 


L 0.34 


0.76 


n on 


160233 


2.07 


0.97 


1 HO 
I . IO 


Neuronal Markers 






166807 








109B77 
162207 


Yes 


1.39 
0.17 


_ 0.93 


5.89 


J 161948 


0.19 


0.58 
0.56 


0.13 
n no 

l u.uy 


| 159898 


Yes 


0.28 


0.42 


n no 


160127 Yes 


0.14 


0.44 


n *i 
U.I 


I 161955 
I 167591 


0.17 


0.38 


n no 
u.uy 




0.18 


0.46 


n -aa 


162006 


0.29 


0.71 


n o 1 


| 160887 


0.89 


1.4 


056 


162247 








165400 i 


1.7 


0.76 


n do ■ 
U.Oid 


RNA Synthesis, Processing a 
161598 


nd Transcription Factors 
0.82 






169429 
165076 


4.52 
0.96 


0.96 
0.8 


0 SZQ 

1 .To 


167103 


1.7 


0.81 
0.72 


1 ni 

1 *\A 
I .04 


169391 


Yes 


0.98 


0.66 


I. ID || 


I 166789 


Yes 


1.76 


0.75 


1-07 I 


I 168428 


Yes 






1 


165588 


1.11 


0.8 


0.57 


164927 


0.51 


1.65 


1.4 


160630 
160367 


Yes 


0.53 
0.58 


1.15 


1.35 


167931 


0.38 


1.26 
0.99 


0.92 
0.35 


161533 


1.59 


0.67 


0.48 


! 168110 


Yes 


0.35 


0.8 


0.21 I 


161374 


Yes 


0.34 


0.89 


0.19 j 


162587 


0.28 


0.63 


0.17 


160467 


Yes 


0.17 


0.44 


0.12 


160803 


Yes 


0.3 


0.71 


0.18 


169254 
160678 


Yes 


0.29 
0.48 


0.6 


0.16 










0.29 
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Table 3 


Unique ID No. of 
Gene 


Expression of Genes in Large Cell (LC), Small Cell 
(SC) and Typical Carcinoma (TC) r>i|s 


v*ene Family 


(LOH) 


LC 


SC 


TC 


160242 
164106 


Yes 


0.59 
0.48 


0.83 


0.31 


166574 


Yes 


0.47 


0.61 
0.89 


0.24 

0.25 3 


169564 


0.25 


0.48 


0.15 J 


164642 


0.69 


°-92 


0.27 I 


162419 


0.59 


1.03 


0.44 J 


163425 


0.95 


0.86 


0.44 


160303 


Yes 


0.62 


1.45 


0.46 


164573 


Yes 


2.23 


0.82 


1.37 



Molecular Signature Of The Subtypes Of Pulmonary Neuroendocrine 
Tumors. The expression profile of genes significantly altered in neuroendocrine 
tumors was examined to determine whether such information could be used to 
differentiate among each subtype of pulmonary neuroendocrine tumors. To 
5 establish a signature list for each tumor type, the relative expression ratio between 
TC, LCNEC and SCLC is employed. Table 4 shows the extent of expression of 
such a signature list, and provides the ratio of expression. In Table 4, TC/SC 
denotes genes exhibiting higher levels of expression in TC cells than in SC cells; 
SC/TC denotes genes exhibiting higher levels of expression in SC cells than in TC 

1 0 cells. Data for TC/LC, LC/TC, SC/LC, and LC/SC are similarly provided. This 
form of statistical analysis is independent of the reference value and, therefore, can 
be used for future studies. Using a ratio of 1.9 or higher, it is found that TC had 15 
genes whose expression distinguished these tumors from SCLC, and 12 from 
LCNEC. In contrast, 134 genes are higher in SCLC than in TC and 97 higher than 

15 in LCNEC (Table 4). The difference between expression of genes in LCNEC 
from SCLC is encompassed within 34 genes. Thus, cDNA microarray analysis 
derived expression profile obtained using a cell line as a reference can be used to 
develop a molecular signature algorithm which may be useful for differential 
diagnosis of these tumors. 
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Table 4 

[olecular Signature of Neuroendocrine Tumors 




Unique ID 
No. of Gene 


Observed Expression 


Ratio 


Observed 
Expression 






TC / SC 






159877 
167158 
166807 
163031 
166328 
165784 
161598 

1t)OO90 


TC 

C Oft 

o.oy 
6.52 
4.46 
3.15 
2.12 
1.96 
2.59 
1.98 


SC 

0.93 
1.16 
0.81 

j 1.02 
0.72 
0.69 
0.96 


TC/SC 

6.33 
5.62 
5.51 
3.09 
2.94 

f 2.84 
2.70 


Normal Cells 

1.06 


■ 


168700 
165192 
165576 
168061 
168276 
165076 
169180 


1.69 
1.56 
! 1.34 
1.23 
1.21 
1.53 
1.61 

SC/TC 


0.96 
0.82 

! 0.76 
0.66 
0.64 
0.63 
0.81 
0.86 


2.10 
2.06 

I 2.05 
2.03 
1.92 
1.92 
1.89 
1.87 






165052 

161163 

160035 

161223 

161948 

166635 

165583 

160715 

162391 

161244 

161211 
( 161774 

166195 
164850 
160802 
161643 
160262 
164206 
164586 
165039 
161374 
159898 
160102 
164646 
163787 
168266 
161790 
162207 
160127 

160323 r 

165483 

161955 

167948 

168638 

167964 


SC 

0.60 

0.53 

0.72 

0.71 

0.56 

0.63 

0.28 

0.94 

0.60 

0.38 

0.64 

0.57 

0.56 

0.45 

0.44 

1.16 

0.58 

0.53 

0.48 

0.19 

0.69 

0.42 

1.07 

0.69 

0.81 

0.45 

0.45 

0.58 

0.44 

0.43 

0.98 

0.38 

1.16 

0.58 

0.33 


TC 

0.08 

0.08 

0.11 

0.11 

0.09 

0.11 

0.05 

0.17 

0.11 

0.07 

0.12 

0.11 

0.11 

0.09 

0.09 

0.24 

0.12 

0.11 

0.10 

0.04 

0.19 

0.09 

0.23 

0.15 

0.18 

0.10 

0.10 

0.13 

0.10 

0.10 

0.23 

0.09 

0.28 

0.14 

0.08 | 


SC/TC 

7.50 
6.63 
6.55 
6.45 
6.22 
5.73 
5.60 
5.53 
5.45 
5.43 
5.33 
5.18 
5.09 
5.00 

4.89 

4.83 

4.83 

4.82 

4.80 

4.75 

4.68 

4.67 

4.65 

4.60 

4.50 

4.50 

4.50 

4.46 

4.40 

4.30 

4.26 

4.22 

4.14 

4.14 

4.13 


Normal Cells 

0.50 
0.40 
0.50 
0.40 
0.22 
0.40 
0.20 
0.67 
0.35 
0.20 
0.35 
0.40 
0.30 
0.38 

0.80 

0.40 
0.35 
0.10 
0.55 
0.26 

0.42 
0.50 

0.55 
0.40 
0.30 
0.73 

1.86 

0.23 | 
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Unique ID 
No. of Gene 

166960 



Table 4 

Molecular Signature of Neuroendocrine Tumors 



Observed Expression 



Ratio 



Observed 
Expression 



165614 



162727 



167116 



160803 



162343 



163682 



162623 



166914 



168110 



160999 



160486 



160275 



169691 



165790 



169254 
168079 



162587 



0.82 



0.45 



0.32 



0.71 



0.67 



0.59 



0.51 



0.61 



0.80 



0.91 



0.72 



0.53 



0.34 



0.45 



0.60 



0.93 



0.63 



0.19 
0.20 



0.11 



0.08 



0.18 



0.17 



0.15 



0.13 



0.16 



0.21 



0.24 



0.19 



0.14 



0.09 



0.12 



0.16 



0.25 



0.17 



4.11 
4.11 



4.10 



4.09 



4.00 



3.94 



3.94 



3.93 



3.92 



3.81 



3.81 



3.79 



3.79 



3.79 



3.78 



3.75 



3.75 



3.72 
3.71 



0.25 
0.20 



0.50 



0.25 



0.50 



0.62 



0.35 



0.60 



0.50 



0.30 



0.56 



167505 



160467 



161012 



159889 



163690 



166574 



167738 



167706 



162006 



166010 



167607 



159906 



162150 



169071 



162178 



164642 



167170 



168386 



167223 



0.74 



0.77 



0.44 



0.73 



0.79 



0.82 



0.89 



0.64 
0.64 



0.71 



0.99 



0.81 



0.62 



1.10 



0.72 



0.24 



0.92 



0.88 



0.81 



0.87 



0.20 



0.21 



0.12 



0.20 



0.22 



0.23 



0.25 



0.18 



0.18 



0.20 



0.28 



0.23 



0.18 



0.32 



0.21 



0.07 



0.27 



0.26 



0.24 
0.26 



3.70 



3.67 



3.67 



3.65 

3.i 



t.59 



3.57 
3.56 



3.56 



3.56 
3.55 



3.54 



3.52 



3.44 



3.44 



3.43 



3.43 



3.41 



3.38 



3.38 



3.35 



0.70 



0.30 



0.55 



0.55 



0.50 



0.62 



0.51 



0.31 



0.55 



0.82 



0.30 



0.60 



0.20 



0.40 



0.52 



167906 



160565 



163824 



167591 



168453 



161794 



163726 



160038 



160678 



167987 



164504 



0.83 



0.63 



0.56 



0.79 



0.46 
0.59 



0.95 



1.21 



1.04 



0.94 



1.03 



0.77 



0.25 



0.19 



0.17 



0.24 
0.14 



0.18 



0.29 



0.37 
0.32 



0.29 



0.32 



0.24 



3.32 



3.32 



3.29 



3.29 



3.29 



3.28 



3.28 



3.27 



3.25 



3.24 



3.22 



3.21 



0.70 



0.56 



0.74 
0.90 



0.63 



168642 



169564 



165171 



161754 



165648 



162734 
160303 
167135 



1.25 



0.96 



0.48 



0.16 



1.11 
0.60 



1.01 



1.45 



0.63 



0.39 



0.30 
0.15 



0.05 



0.35 



0.19 



0.32 



0.46 



0.20 



3.21 



3.20 



3.20 



3.20 



3.17 



3.16 



3.16 
3.15 



3.15 



0.60 



0.48 



0.65 



1.30 
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Table 4 

[olecular Signature of Neuroendocrine Tumors 


Unique ID 
No. of Gene 

160098 
169551 
164244 
162220 
164286 
161635 
167713 
163276 
161178 
167553 
163921 

I 167931 

| 160938 
163306 
161650 
162631 
161026 
162571 
160478 

I 160441 


Observe 

0.91 
0.80 
0.67 
0.76 
0.94 
1 nfi 

1 .uo 

0.77 
0.47 
1.05 
0.67 
0.52 
0.99 
0.82 
0.98 
1.23 
1.06 
0.78 
1.11 
1.07 
0.49 


d Expression 

0.29 
0.26 

j 0.22 
0.25 
0.31 
0.35 
0.26 
0.16 
0.36 
0.23 
0.18 
0.35 
0.29 
0.35 

! 0.44 
0.38 
0.28 
0.40 

r 0.39 
0.18 


Ratio 

3.14 
3.08 

! 3.05 
3.04 
3.03 
3.03 
2.96 
2.94 
2.92 
2.91 
2.89 
2.83 
2.83 
2.80 
2.80 
2.79 
2.79 
2.78 
2.74 


Observed 
Expression 

I 0.50 

0.60 
0.80 

0.60 
0.55 
0.50 

0.50 ~J 
0.80 


165786 
165571 
161620 
165813 
160242 
168302 
167351 
168987 

! 161574 
162510 
164106 

I 161454 
160887 
165799 
162419 
166359 

I 169625 
168426 

i 163425 


0.95 
0.84 
0.84 
0.75 
0.83 
0.88 
u.oy 
0.79 
1.05 
0.91 
0.61 
0.99 
1.40 
0.67 
1.03 
0.64 
0.88 
1.09 
0.86 

TC/LC 


0.35 
0.31 
0.31 
0,28 
0.31 
0.33 
0.26 
0.30 
0.40 
0.35 
0.24 
0.39 
0.56 
0.27 
0.44 
0.28 
0.44 
0.55 

0.44 j 


2.72 
2.71 
2.71 
2.71 
2.68 
2.68 
2.67 
2.65 
2.63 
2.63 
2.60 
2.54 
2.54 
2.50 
2.48 
2.34 
2.29 
2.00 
1.98 

1.95 


0.42 J 

0.60 1 

0.80 

0.80 

0.70 

0.40 

0.72 
0.50 
0.60 
1.24 

0.55 ] 
0.80 J 

0.80 


167158 
159877 
166807 
161598 
I 164927 
163031 
160630 


TC 

6.52 
5.89 
4.46 

2.59 

•1 An 
1 .4U 

3.15 
1.35 


T C 

0.87 
1.39 
1.11 
0.82 
0.51 
1.22 


IC/JLC 

7.49 
4.24 
4.02 
3.16 

2.75 I 
2.58 


Normal Cells 


162247 
I 167219 
163475 
163762 
1 166328 


1.40 
1.16 
1.17 
1.04 
2.12 

LC/TC 


0.53 
0.67 
0.57 
0.60 
0.54 
1.14 


2.55 
2.09 
2.04 
1.95 
1.93 
1.86 




f 165400 
164850 
164868 


LC 

1.70 
0.20 
2.39 


TC 

0.82 
0.09 
1.16 


LC/TC 

2.07 
2.22 
2.06 


Normal Cells 



WO 2004/041196 



PCT/US2003/034787 



-43- 







Table 4 






M 


olecular Signature of Neuroendocrine Tiimnrc 


Unique ID 
No. of Gene 

161533 
[ 160957 
169429 
169432 
I 165583 
165617 
168987 
|] 161709 
169625 
165799 


Observei 

1.59 
3.20 
4.52 
2.04 
0.10 
2.90 
0.60 
1.89 
0.98 
0.53 


i Expression 

I 0.48 
1.16 
1.18 
1.04 
0.05 
1.32 
0.30 
0.79 
0.44 


Ratio 

3.31 
2.76 
3.83 
1.96 
2.00 
2.20 
2.00 
2.39 
2.23 


Observed 
Expression 

0.20 


161896 
165813 
162571 
161948 
167116 

!| 167125 
167153 
162734 
163425 

i 164106 
160237 


2.12 
0.59 
1.32 
0.19 
0.18 
3.23 
6.27 
0.73 
0.95 
0.48 
3.50 


0.27 
1.04 
0.28 
0.40 
0.09 
0.08 
1.36 
1 **1 

0.32 
0.44 
0.24 
1.38 


1.96 
2.04 
2.11 
3.30 
2.11 
2.25 
2.38 
4.79 
2.28 
2.16 
2.00 
2.54 


0.55 
0.70 

0.80 I 
0.22 i; 

— 

0.60 
0.80 
0.50 

1 


1 64206 
1 64244 
168266 
1 60429 


0.26 
0.43 
0.20 
2.54 


0.11 
0.22 
0.10 
0.94 


2.36 
1.95 
2.00 
2.70 


— 

0.94 


159898 
160441 
167713 


0.28 
0.37 
0.64 


0.09 
0.18 
0.26 


3.11 
2.06 
2.46 


0.25 
0.42 


1 \J\i\J\jC 


0.18 


0.08 


2.25 


0.50 


159906 
161117 


0.42 
2.52 


0.18 


2.33 


0.30 


161163 


0.18 


1.1^ 

0.08 


2.25 
2.25 


1.12 
0.35 


160565 


0.45 


0.17 


2.65 


0.50 


164504 
165171 


0.51 
0.30 


0.24 
0.05 


2.13 
6.00 


0.80 


16191 1 
I 1 60605 


0.27 
5.94 


0.12 
1.06 


2.25 
5.60 


0.35 
0.78 


160617 
1 67906 


3.57 
0.40 


0.93 
u.iy 


3.84 
2.11 


0.90 
0.80 


167948 
164642 
I 164646 


0.61 
0.69 
0.39 


0.28 
0.27 
0.15 


2.18 
2.56 
2.60 


0.45 
0.42 


164652 


2.63 

SC/LC 


1.18 


2.23 






SC 






JNormal Cells 


161244 


0.38 




3.80 


0.20 


161223 
162391 


0.71 
0.60 


0.20 
0.17 


3.55 

3.53 i 


0.40 
0.35 


166635 
160035 
162207 
165052 


0.63 
0.72 
0.58 
0.60 


0.18 
0.21 
0.17 
0.18 


3.50 
3.43 

3.41 | 


0.40 
0.50 

0.55 i 


161954 
164927 


0.78 
1.65 


0.24 
0.51 


3.33 
3.25 
3.24 


0.50 

0.20 i 


160127 


0.44 


0.14 


3.14 


0.47 


160262 


0.58 


0.19 


3.05 




161643 


1.16 


0.39 


2.97 


0.80 


165483 
166195 


0.98 
0.56 


0.33 


2.97 


0.73 






0.19 


2.95 


0.30 
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Table 4 

Molecular Signature of Neuroendocrine Tumors 



Unique ID 
No. of Gene 



161948 



161163 



167223 



161774 



160715 



164586 



161790 



165583 



168638 



160802 



160102 



165039 



163762 



161374 



163787 



161012 



167931 



160467 



165614 



167591 



165790 



162244 



162631 



161635 



162006 



162247 



169071 



159889 



160323 



161211 
160803 



Observed Expression 



0.56 



0.53 



0.87 



0.57 



0.94 



0.48 



0.45 



0.28 



0.58 



0.44 



1.07 



0.19 



1.44 



0.89 



0.81 



0.73 



0.99 



0.44 



0.82 



0.46 



0.45 



0.74 
1.06 



1.06 



0.71 



1.62 



0.72 



0.79 



0.43 



0.64 
0.71 



0.19 



0.18 



0.30 



0.20 



0.33 



0.17 



0.16 



0.10 



0.21 



0.16 



0.39 



0.07 



0.54 



0.34 



0.31 



0.28 



0.38 



0.17 



0.32 



0.18 



0.18 



0.30 



0.43 



0.43 



0.29 



0.67 



0.30 



0.33 



0.18 



0.27 



Ratio 



2.95 



2.94 



2.90 



2.85 



2.85 



2.82 
2.81 



2.80 



2.76 



2.75 



2.74 



2.71 



2.67 



2.62 



2.61 



2.61 



2.61 



2.59 



2.56 



2.56 
2.50 



2.47 



2.47 



2.47 



2.45 



2.42 



2.40 



2.39 



2.39 



2.37 



Observed 
Expression 



0.22 



0.35 



0.65 



0.45 
0.67 



0.35 



0.20 



0.58 



0.10 



0.55 



0.50 



0.55 



0.30 
0.50 



0.30 



0.70 



0.80 



0.31 



0.55 



0.30 



0.35 



160303 



161794 



168110 



167706 



169691 



168386 



162587 



168266 



164850 



162727 



162220 



1.45 
0.95 



0.80 



0.64 



0.34 



0.81 
0.63 



0.45 



0.45 



0.45 



0.76 



0.30 



0.62 



0.41 



0.35 



0.28 



0.15 



0.36 



0.28 



0.20 



0.20 
0.20 



0.34 



2.37 



2.34 



2.32 



2.29 



2.29 



2.27 



2.25 



2.25 



2.25 



2.25 



2.25 



2.24 



0.55 



1.00 



0.70 



0.38 



0.25 



0.60 



161955 
162623 



160038 



167964 



166010 
167170 



167219 



163682 



162178 



166960 



160367 



160630 



160999 



160275 



161754 



163921 



169254 
164206 



0.38 



0.51 



1.04 



0.33 



0.99 



0.88 



1.25 
0.59 



0.24 



0.37 



1.26 



1.15 



0.91 



0.53 



1.11 



0.52 



0.60 



0.53 



0.17 



0.23 



0.47 



0.15 



0.45 



0.40 



0.57 



0.27 



0.11 



0.17 
0.58 
0.53 
0.42 



0.25 



0.53 



0.25 



0.29 



0.26 



2.24 



2.22 



2.21 



2.20 



2.20 



2.20 



2.19 



2.19 



2.18 



2.18 



2.17 



2.17 



2.17 



2.12 



2.09 



2.08 
2.07 



2.04 



0.36 



0.55 



0.52 



0.20 



0.25 



0.60 



0.60 



0.55 



0.28 
0.40 
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Table 4 I 
[olecular Signature of Neuroendocrine T..mnr c I 


Unique ID 
No. of Gene 

166914 
162343 

! 163824 
167607 
160098 
168079 
161178 
160938 

i 167738 
167505 
159859 


Observe 

0.61 
0.67 
0.79 
0.81 
0.91 
0.93 
1.05 
0.82 
0.64 
0.77 


(J Expression 

0.30 
0.33 

! 0.39 
0.40 
0.45 
0.46 
0.52 
0.41 
0.32 
0.39 
0.73 


Ratio 

2.03 

L 2.03 
2.03 
2.03 
2.02 
2.02 
2.02 
2.00 
2.00 
1.97 
1.97 


Observed 
Expression 

9^2 

0.65 
0.50 

0.56 i 

£60 

0.50 J 
0.51 □ 

0,90 


167553 
162150 
160678 

I 163690 
160486 
160478 

I 165648 
161391 
169564 
167948 
166574 
167135 


0.67 
1.10 
0.94 
0.82 
0.72 
1.07 
0.60 
0.83 
0.48 
1.16 
0.89 
0.63 

LC/SC 


0.34 
0.56 
0.48 
0.42 
0.37 
0.55 
0.31 
0.43 
0.25 
0.61 
0.47 
0.34 


1.97 
1.96 
1.96 
1.95 
1.95 
1.95 
1.94 
1.93 
1.92 
1.90 
1.89 
1.85 


0.50 
0.50 

0.70 


165393 

168700 
i 169384 

165400 

161533 

160957 

169429 

169432 

165576 
I 165617 

161709 
! 165784 

162475 

161896 

167103 

167125 
167153 
167316 
166789 
! 168061 
' 160233 
160237 
! 168141 
168169 
168276 
159813 
160429 
161117 
165171 
164573 
160605 
160617 
169180 

164652 " 


LC 

2.66 
1.91 
2.28 
1.70 
1.59 
3.20 
4.52 
2.04 

1.93 [ 

2.90 

1.89 

1.46 

2.00 

2.12 

1.70 

3.23 

6.27 

1.94 

1.76 

1.32 

2.07 

3.50 

2.51 

2.78 

1.61 

1.99 

2.54 

2.52 

0.30 

2.23 

5.94 

3.57 

1.88 

2.63 


SC 

0.96 

0.82 

0.77 

0.76 

0.67 

0.77 

0.80 

0.65 

0.66 

0.73 

0.95 

0.69 

1.06 

0.75 

0.72 

0.88 

1.00 

0.88 

0.75 

0.64 

0.97 

0.92 

0.95 

1.17 

0.63 

0.83 

0.71 

0.75 

0.16 

0.82 

0.84 

0.86 

0.86 

0.97 


LC/SC 

2.77 

2.33 

2.96 

2.24 

2.37 

4.16 

5.65 

3.14 

2.92 

3.97 

1.99 

2.12 

1.89 

2.83 

2.36 

3.67 

6.27 

2.20 

2.35 

2.06 

2.13 

3.80 

2.64 

2.38 

2.56 

2.40 

3.58 

3.36 

1.88 

2.72 

7.07 

4.15 

2.19 

2.71 


Normal Cells 

1.00 

0.90 

0.78 
0.90 
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Correlation Between Gene Expression Profiles And Genomic 
Imbalance. To compare the results obtained from cDNA array expression in 
accordance with the present invention with previously available information on 
genomic imbalances in neuroendocrine tumors, a search of the literature for 
published data on comparative genomic hybridization (CGH) and loss of 
heterozygosity (LOH) in neuroendocrine tumors was conducted. It was found that, 
among 198 genes identified by the Class Comparison (F-test) analysis, over ninety 
percent of genes with significant changes in LCNEC, and over 80% of genes from 
SCLC and TC, had previously been reported to have chromosomal imbalances by 
gain or loss (CGH) or to be associated with LOH (Table 5). Loss of chromosomal 
materially LOH closely correlated with genes whose expression significantly 
decreased in our analysis. Deletions of several genes, such as cyclin-dependent 
kinase inhibitor (CDKN2A, 9p21) and multiple endocrine neoplasia 1 (MEN1, 
1 lql3) have been studied extensively in pulmonary neuroendocrine tumors 
(Oliveira, A.M. et al (2001) "Familial Pulmonary Carcinoid Tumors," 
Cancer 91:2104-2109; Debelenko, L.V. et al (2000) "MEN1 gene mutation 
analysis of high-grade neuroendocrine lung carcinoma/' Genes Chromosomes 
Cancer. 28:58-65). However, several genes whose expression has been found to be 
decreased herein were previously reported to have a gain of chromosomal material 
by CGH. These include BAK, excision repair cross-complement (ERCC1), DNA 
ligase (LIG1), tubulin beta (TUBB) and others (Table 2). 

Of interest, none of the genes which encode for growth factor/receptors 
identified herein have been reported by LOH. However, loss of genetic material 
by CGH in these genes has been reported. The potential loss of repressor activity 
in the promoter regions of these genes may result in their over-expression as 
detected herein. In sum, the expression profiling of significantly altered genes 
derived from microarray data reported herein closely correlates with chromosomal 
imbalances reported by LOH but not by CGH. 
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Example 3 
Analysis of Gene Expression Profiles 

Analysis of clusters of differentially expressed mRNAs from 9,984 human 
transcripts assigned to each subtype of neuroendocrine tumors identified multiple 
genes (198 genes with a probability of 0.004) exhibiting differential expression. 
This highly selected group of genes contained valuable information which 
correlated with biological behavior of these tumors. The identified genes are 
involved in regulation of apoptosis, cell-cell and cell-matrix interactions, cell 
cycle, DNA synthesis and repair, drug resistance, RNA synthesis and processing, 
receptors and growth factors. Previous studies using microarray analysis of 
lymphomas (Dodson, J.M. etal. (2002) "Quantitative Assessment Of Filter- 
Based Cdna Microarrays: Gene Expression Profiles Of Human T- 
Lymphoma Cell Lines," Bioinformatics 18:953-960; Ramaswamy, S. etal. 
(2001) Multiclass Cancer Diagnosis Using Tumor Gene Expression 
Signatures," Proc Natl Acad Sci USA. 98(26):15149-15154), gastrointestinal 
(Hippo, Y. etal. (2002) "Global Gene Expression Analysis Of Gastric 
Cancer By Oligonucleotide Microarrays," Cancer Res. 62(l):233-240; 

Selaru, F.M. etal. (2002) "ARTIFICIAL NEURAL NETWORKS DISTINGUISH AMONG 

Subtypes Of Neoplastic Colorectal Lesions," Gastroenterology 122:606- 
613), ovarian (Ramaswamy, S. et al. (2001) Multiclass Cancer Diagnosis 
Using Tumor Gene Expression Signatures," Proc Natl Acad Sci USA. 
98(26): 15 149-15 154), and other types of human tumors found that over-expression 
of specific genes is a prominent feature that facilitated the molecular classification 
of these tumors. In contrast, a significant decrease in expression in the majority of 
the selected genes was found. One of the major survival pathways is regulated by 
protection of the mitochondrial membrane by BCL2 which is frequently over- 
expressed in tumor cells (Cleary, MX. et al. (1986) "Cloning And Structural 
Analysis Of Cdnas For Bcl-2 And a Hybrid Bcl-2/Immunoglobulin 
Transcript Resulting From The t( 14; 18) Translocation," Cell. 47(l):l9-28). 
Decreased expression of BCL2 antagonists, BAD and BAK1 was observed in 
samples from TC and LCNEC. This feature may provide survival advantage 
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without the need for over-expression of BCL2 as occurs in certain types of 
lymphomas. BAD and BAK1 are located on chromosomes 1 lql3 and 6p21, 
respectively, which are in the regions of loss of heterozygosity (LOH) in 
neuroendocrine tumors (Hofmann, W.K. (2002) "Relation Between Resistance 

5 OF PHILADELPHIA-CHROMOSOME-POSITIVE ACUTE LYMPHOBLASTIC LEUKAEMIA 

To The Tyrosine Kinase Inhibitor ST1571 And Gene-Expression Profiles: A 
Gene-Expression Study," Lancet 359:481-486). Expression of BAK was further 
suppressed in TC and LCNEC below the level expected for LOH which suggests 
an additional regulatory mechanism. Interestingly, gain of chromosomal material 
10 in 6p21 was reported in LCNEC by CGH (Michelland, S. et al. (1999) 

"Comparison of Chromosomal Imbalances In Neuroendocrine And Non- 
Small-Cell Lung Carcinomas," Cancer Genet Cytogenet 1 14:22-30). 
Suppression of other apoptosis-promoting genes, such as caspase 4 (CASP4), may 
also provide survival advantage and has not been previously reported in 

1 5 Neuroendocrine tumors. Loss of expression of many genes which regulate cell-cell 
and cell-matrix interactions as well as DNA and RNA synthesis and repair were 
apparent in all tumor types (Table 2). Table 2 shows representative deregulated 
genes classified by function. Genes selected by F-test with probability of <0.004 
were genes assigned to functional categories and compared with the published 

20 comparative genomic hybridization (CGH) results (Michelland, S. et al. (1 999) 
"Comparison Of Chromosomal Imbalances In Neuroendocrine And Non- 
Small-Cell Lung Carcinomas," Cancer Genet Cytogenet 1 14:22-30; Lui, W.- 
O. et al. (2001) "High Level Amplification Of 1P32-33 And 2p22-24 In Small 
Cell Lung Carcinomas" Intl. J Oncol. /P:451-457; Ullmann, R., et al. (2001) 

25 "Chromosomal Aberrations In A Series Of Large-Cell Neuroendocrine 
Carcinomas: Unexpected Divergence From Small-Cell Carcinoma Of The 
Lung," Hum Pathol. 32:1059-63; Walch, A.K. etal. (1998) "Typical And 
Atypical Carcinoid Tumors Of The Lung Are Characterized By l 1q 
Deletions As Detected By Comparative Genomic Hybridization" Am J 

30 Pathol. 753:1089-98). 
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In the table, SC denotes small cell; LC denotes large cell neuroendocrine 
carcinoma; and TC denotes typical carcinoid. 

Most studies performed to-date compare tumor samples with cDNA from 
normal tissues of an individual patient, pooled normal tissues or pooled cell lines 
5 as reference. To illustrate the invention, RNA from a single human cell line 
derived from normal bronchial epithelium, BEAS-2B (Amstad, P. etal. (1988) 
"Neoplastic Transformation Of A Human Bronchial Epithelial Cell Line 
By A Recombinant Retrovirus Encoding Viral Harvey Ras," Mol Carcinog. 
1988 1:151-60), was used as a reference RNA. This cell line has minimal 
1 0 chromosomal rearrangements in early passages and neuroendocrine tumor features 
(Lee, B.H et al. (1 998) "In Vitro Chromosome Aberration Assay Using 
Human Bronchial Epithelial Cells," J. Toxicol Environ. Health A. 55:325-9). 
Thus, the data indicate that accurate classification of neuroendocrine tumors can be 
achieved by comparing gene expression profiles of tumors to a single cell line 
. 15 derived from the same cell type. This method is applicable to analysis of tumor- 
derived gene expression profiles from other organs, such as brain, where 
availability of normal tissue is limited. 

In addition to suppression of the apoptotic pathway, only LCNEC tumors 
had increased expression (2-6- fold) of several receptors and growth factors. 

20 Increased expression of PDGFRB in conjunction with suppression of PDGFA- 

associated protein, which can down regulate the activity of PDGFA, could result in 
additional proliferative signal and contribute to the aggressive behavior of this 
tumor. In addition, high expression of an adhesion plaque-associated protein, 
P3 1 1, which has been recently identified as a glioblastoma invasion gene (Mariani, 

25 L. etal. (2001) "Identification And Validation OfP31 1 As A Glioblastoma 
Invasion Gene Using Laser Capture Microdissection," Cancer Res 61 :4l90- 
4196) was detected. 

The lack of a similar pattern of gene expression in SCLC may result from 
the small number of samples examined or may result from different transforming 
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mechanisms since oncogenic mutations (p21 M , p53 and others) but not over- 
expressions are associated with SCLC (Wistuba, LI. etal. (2001) "MOLECULAR 
Genetics Of Small Cell Lung Carcinoma," Semin Oncol 28: 3-13). 
Functional analysis of genes whose expression significantly altered in pulmonary 
5 neuroendocrine tumors provides insight into the underlying biological mechanism, 
leading to survival and slow progression of TC whereas LCNEC and SCLC have 
an aggressive behavior. 

Many studies have identified genes whose expression is significantly 
suppressed in neuroendocrine tumors. High incidence of LOH at 3p, 5q, ilq, and 
1 0 17p (Ohnuki, Y. et al. (l 996) "Chromosomal Changes And Progressive 

Tumorigenesis Of Human Bronchial Epithelial Cell Lines," Cancer Genet. 
Cytogenet 92:99-1 10), except for chromosome 13q, correlates with significant 
decrease in expression of genes assigned to these locations, including MENI 
(1 lql3). The data adds to previously reported studies and confirms that expression 

1 5 profiling of lung neuroendocrine tumors provides accurate tumor classification. 
The molecular signature of relative abundance of gene expression derived by 
comparing mean gene expression of each 3 tumor subtypes is independent of the 
reference RNA and is of particular interest because of its clinical relevance. These 
results indicate that gene expression profiling of pulmonary neuroendocrine tumors 

20 provides a diagnostic tool for tumor classification, particularly when 
histopathology interpretation is ambiguous. 

In summary, light microscopy-based classification of pulmonary 
neuroendocrine tumors is often difficult. To search for molecular markers of 
neuroendocrine tumors, cDNA microarrays of 9,984 human transcripts were used 

25 to identify classification-associated genes at a global genomic scale. Laser-capture 
microdissection was used to harvest tumor cells from frozen sections. The gene 
expression profiles in primary pulmonary neuroendocrine tumors from 17 surgical 
specimens (1 1 Typical Carcinoids,(TC), 3 Small Cell lung cancers (SCLC), 2 
Large Cell Neuroendocrine tumors (LNEC), and one sample which had features of 

30 SCLC and LNEC) were compared. The BRB ArrayTool (National Cancer 
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Institute, NIH; http://linus.nct t nih.gov/BRB-ArravTools.htmn was employed to 
analyze gene expression patterns. An unsupervised, hierarchical clustering 
algorithm used to analyze these 17 tumors based only on similarities in gene 
expression resulted in a precise classification of each tumor type. The Class 
5 Comparison Tool used to compare each tumor type identified 198 statistically 
significant genes (p<0.004) that accurately discriminated between 3 pre-defined 
tumor types. Analysis of these genes revealed that deletions were more frequent 
than were amplifications in pulmonary neuroendocrine tumors. Using comparative 
analysis of gene expression variance, a molecular signature for each tumor type 

10 was identified. The signature genes included decreased expression of pro- 

apoptotic genes, cell-cell and cell matrix interacting components, cell cycle control 
and DNA repair, and anti-oncogenes. In particular, decreased expression of the 
BCL2 antagonist, BAK1, was found in all tumor types, whereas BAD was 
decreased in LCNEC and TC tumors. Over-expression of several growth factors 

15 and receptors (CSF2RB, PDGFRB, ELI 3RA2, and IL6ST (gpI30)) was detected 
only in LCNEC tumors, and increased expression of IL-8Rp was shared by TC 
tumor cells. High expression of a neuronal marker, P31 1, previously reported to 
promote invasive phenotype in brain tumors, was detected in LCNEC, and a 
peptide processing enzyme, Carboxypeptidase E (CPE), was found in TC. The 

20 analysis indicates that functional genomic comparison of expression profiles can 
accurately classify pulmonary neuroendocrine tumors and will therefore facilitate 
the development of new therapies for patients having these malignancies. 

Table 5 lists genes that are differentially expressed in different 
neuroendocrine tumors. 

1 Table 5 

Genes Differentially Expressed In Small Cell Lung Cancer (SCLC) 
Neuroendocrine Tumor Cells Relative To Large Cell Neuroendocrine 
Carcinoma (LCNEC) Neuroendocrine Tumor Cells 

IncytePD:523635 IncytePD: 17341 13 IncytePD:2074154 

IncytePD:56 1992 IncytePD: 1 743234 IncytePD:2 1 041 45 

IncytePD:605019 IncytePD: 1749727 IncytePD:2 172334 

IncytePD:6 14679 IncytePD: 1755793 IncytePD:2 180031 

IncytePD:629077 IncytePD: 1808260 IncytePD:2 182907 

IncytePD:637639 IncytePD: 1810821 IncytePD :2200079 
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Table 5 



IncytePD:696002 
IncytePD:740878 
IncytePD:771715 
IncytePD:820580 
IncytePD:849425 
IncytePD:942207 
IncytePD:958513 
IncytePD:961082 
IncytePD:998069 
IncytePD: 1258790 
IncytePD: 1297269 
IncytePD:1308112 
IncytePD: 1339241 
IncytePD: 1382374 
IncytePD: 1402615 
IncytePD: 1405652 
IncytePD: 143 1819 
IncytePD: 1435374 
IncytePD: 1445203 
IncytePD: 1453450 
IncytePD: 1481225 
IncytePD: 1486983 
IncytePD: 150 1080 
IncytePD: 1555545 
IncytePD:1561352 
IncytePD: 1567995 
IncytePD: 1603584 
IncytePD: 16 10083 
IncytePD: 1624024 
IncytePD: 1625169 
IncytePD: 1635008 
IncytePD: 163 75 17 
IncytePD: 16539 11 
IncytePD: 1685342 
IncytePD:1691161 
IncytePD:1699149 
IncytePD: 1702266 
IncytePD: 1 969563 



IncytePD:1821971 

IncytePD: 1824957 

IncytePD:1841920 

IncytePD:1853163 

IncytePD: 1857493 

IncytePD: 1872067 

IncytePD: 1890919 

IncytePD:1921567 

IncytePD: 193 1265 

IncytePD: 1942845 

IncytePD:1960722 

IncytePD:1968721 

IncytePD: 1988239 

IncytePD: 1990361 

IncytePD:1997937 

IncytePD: 1997967 

IncytePD:2048144 

IncytePD:2050085 

IncytePD:2054529 

IncytePD:2055640 

IncytePD:2055687 

IncytePD:2055773 

IncytePD:2055926 

IncytePD:2056149 

IncytePD:2056172 

lncytePD:2056987 

IncytePD:2057547 

IncytePD:2057823 

IncytePD:2058537 

IncytePD:2060308 

IncytePD:2679117 

IncytePD:2740235 

IncytePD:2751387 

IncytePD:2852403 

IncytePD:2956581 

IncytePD:2956906 

IncytePD:3032691 

IncytePD:3032825 



IncytePD:2205246 

IncytePD :23 08525 

IncytePD:23 56635 

IncytePD:2374294 

IncytePD:2469592 

IncytePD:2506427 

IncytePD :2507648 

IncytePD:2508570 

IncytePD:2568547 

IncytePD:26 10374 

IncytePD:2663948 

IncytePD:2674277 

IncytePD:3038508 

IncytePD:3115514 

IncytePD:3 123858 

IncytePD:3179113 

IncytePD:3202075 

IncytePD:3255437 

IncytePD:3333130 

IncytePD:3360476 

IncytePD:3381870 

IncytePD:3427560 

IncytePD:3432534 

IncytePD:3518380 

IncytePD:3562795 

IncytePD:3842669 

IncytePD:3967780 

IncytePD:3990209 

IncytePD:3999291 

IncytePD:4014715 

IncytePD:40 16254 

IncytePD :4059 193 

IncytePD:4144001 

IncytePD:4287342 

IncytePD:4626895 

IncytePD:5017148 

IncytePD: 5096975 



Genes Differentially Expressed In Small Cell Lung Cancer (SCLC) 
Neuroendocrine Tumor Cells Relative To Typical Carcinoid (TC) 
Neuroendocrine Tumor Cells 



IncytePD:477045 
IncytePD:478960 
IncytePD:523635 
IncytePD:557451 
IncytePD:561992 
IncytePD:588157 
IncytePD:605019 
IncytePD:696002 
IncytePD:740878 
IncytePD:771715 
IncytePD:8 18568 
IncytePD:820580 
IncytePD:885601 



IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD 



1748705 
1749727 
1755793 
1773638 
1807294 
1808260 
1810821 
1812955 
1822716 
1824957 
1841920 
1853163 
1857493 



IncytePD; 
IncytePD 
IncytePD 
IncytePD 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD 
IncytePD: 



:2453436 
2469592 
2506427 
:2508570 
2610374 
2622566 
2663948 
2674277 
2679117 
2722572 
2728840 
2740235 
2748942 
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Table 5 



IncytePD:899102 
IncytePD:958513 
IncytePD:961082 
IncytePD: 1240748 
IncytePD: 1258790 
IncytePD: 1297269 
IncytePD: 13081 12 
IncytePD:1402615 
IncytePD: 1405652 
IncytePD: 143 18 19 
IncytePD: 1435374 
IncytePD: 1445203 
IncytePD: 1453450 
IncytePD: 148 1225 
IncytePD: 1486983 
IncytePD:1488021 
IncytePD: 1505977 
IncytePD: 1513989 
IncytePD: 1559756 
IncytePD.1561867 
IncytePD: 1562658 
IncytePD: 1567995 
IncytePD: 1603584 
IncytePD: 161 0083 
IncytePD: 1624024 
IncytePD:1625169 
IncytePD: 1635008 
IncytePD: 16539 11 
IncytePD: 1669254 
IncytePD: 1672749 
IncytePD:1691161 
IncytePD: 1693 847 
IncytePD: 1699 149 
IncytePD: 1702266 
IncytePD: 1704168 
IncytePD:1712663 
IncytePD:1734113 



IncytePD:! 858365 

IncytePD: 1872067 

IncytePD: 18909 19 

IncytePD: 1920650 

IncytePD: 1921567 

IncytePD: 193 1265 

IncytePD: 1942845 

IncytePD: 1960722 

IncytePD:1968721 

IncytePD: 1988239 

IncytePD: 1997792 

IncytePD:2050085 

IncytePD:2054529 

IncytePD:2055640 

IncytePD:2055687 

IncytePD:2055773 

IncytePD:2055926 

IncytePD:2056149 

IncytePD:2056172 

IncytePD:2056642 

IncytePD:2056987 

IncytePD:2057547 

IncytePD:2057823 

IncytePD:2057908 

IncytePD:2058537 

IncytePD:2060308 

IncytePD:2074154 

IncytePD:2104145 

IncytePD:2153373 

IncytePD:2172334 

IncytePD:2180031 

IncytePD:2182907 

IncytePD:2304121 

IncytePD:2356635 

IncytePD:2369544 

IncytePD:2374294 

IncytePD:2383065 



IncytePD:2751387 

IncytePD:2758740 

IncytePD:2798872 

IncytePD:2806778 

IncytePD:2852403 

IncytePD:2888814 

IncytePD:2914719 

IncytePD:2923082 

IncytePD:2956906 

IncytePD:3010959 

IncytePD:3032691 

IncytePD:3032825 

IncytePD:3038508 

IncytePD:3115514 

IncytePD:3 123858 

IncytePD:3179113 

IncytePD:3202075 

IncytePD:3334367 

IncytePD:3381870 

IncytePD:3432534 

lncytePD:3518380 

IncytePD:3562795 

IncytePD:3728255 

IncytePD:3805046 

IncytePD:3871545 

IncytePD:3954785 

IncytePD:3967780 

IncytePD:3990209 

IncytePD:3999291 

IncytePD:4014715 

IncytePD:4059193 

IncytePD :4 144001 

IncytePD:4253663 

IncytePD:4626895 

IncytePD:5017148 

IncytePD:5096975 



Genes Differentially Expressed In Large Cell Neuroendocrine 
Carcinoma (LCNEC) Neuroendocrine Tumor Cells Relative To 
Typical Carcinoid (TC) Neuroendocrine Tumor Cells 



IncytePD 
IncytePD: 
IncytePD 
IncytePD 
IncytePD 
IncytePD; 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 



629077 
637639 
818568 
885601 
899102 
942207 
1308112 
1402615 
1435374 
1488021 
1501080 
1505977 
1555545 
1559756 



IncytePD 
IncytePD: 
IncytePD: 
IncytePD 
IncytePD 
IncytePD 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD; 
IncytePD: 
IncytePD: 
IncytePD: 



1748705 
1773638 
: 1807294 
:1812955 
:1821971 
:1822716 
: 1858365 
: 1872067 
: 1990361 
1997967 
:2048144 
2153373 
2205246 
2299818 



IncytePD: 
IncytePD: 
IncytePD 
IncytePD 
IncytePD 
IncytePD; 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 
IncytePD: 



:2507648 
:2508570 
:2622566 
:2679117 
:2728840 
:2806778 
:2888814 
:2914719 
:2956581 
3255437 
3333130 
3360476 
3427560 
3518380 
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Table 5 1 

IncytePD:1561352 IncytePD:2304121 IncytePD: 3 805046 ~ 

IncytePD:1561867 IncytePD:23 08525 IncytePD:40 16254 

IncytePD: 16 10993 IncytePD :23 69544 IncytePD:4 144001 

IncytePD: 1704168 IncytePD :2453 436 IncytePD :42 8 73 42 

IncytePD:1712663 IncytePD :2469592 

IncytePD: 1743234 IncytePD :2506427 

The methods employed in the present invention can be similarly employed 
to facilitate the diagnosis of other tumor types, for example, adenocarcinomas, 
which are distinct from neuroendocrine tumors and exhibit significant differences 
in gene expression (Garber, M. E. et al (2001) "Diversity Of Gene Expression 
In Adenocarcinoma Of The Lung" Proa Natl Acad Sci. (USA.) 98: 13784- 
13789; Bhattacharjee, A. etal (2001) "Classification Of Human Lung 
Carcinomas By mRNA Expression Profiling Reveals Distinct 
Adenocarcinoma Subclasses" Proc. Natl. Acad. Sci. (U.S.A.) 98:13790- 
13795). cDNA rnicroarrays that can be used to identify profiles of genes expressed 
in adenocarcinomas are disclosed by Miura, K. et al (2002) ("Laser Capture 
Microdissection And Microarray Expression Analysis Of Lung 
Adenocarcinoma Reveals Tobacco Smoking- And Prognosis-Related 
Molecular Profiles," Cane. Res. 62:3244-3250). 

Example 4 
Analysis of Gene Expression Profiles 

As indicated above, DNA microarray technology (Schena, M. etal (1995) 
"Quantitative Monitoring Of Gene Expression Patterns With A 
Complementary DNA Microarray," Science 270:467 -470; DeRisi, J. et al 
(1 996) "Use Of A cDNA Microarray To Analyse Gene Expression Patterns 
Inhuman Cancer," Nat Genet 14:457-460) provides a powerful tool to analyze 
genome-wide changes in gene expression. Applications of this technology to 
human lung cancers facilitate the identification of gene expression profiles and 
biomarkers associated with adenocarcinoma (Miura, K. et al (2002) "Laser 
capture microdissection and microarray expression analysis of lung 
adenocarcinoma reveals tobacco smoking- and prognosis-related molecular 
profiles," Cancer Res 62:3244-3250; Sugita, M. et al , (2002) "Combined Use Of 
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Oligonucleottde And Tissue Microarrays Identifies Cancer/Testis 
Antigens As Biomarkers In Lung Carcinoma," Cancer Res 62:3971-3979; 
Bhattacharjee, A. et al (200 1) "Classification Of Human Lung Carcinomas 
By mRNA Expression Profiling Reveals Distinct Adenocarcinoma 
5 Subclasses," Proc Natl Acad Sci USA 2001; 98:13790-13795) and NSCLC 
(Heighway, J. et al (2002) "Expression Profiling Of Primary Non-Small 
Cell Lung Cancer For Target Identification," Oncogene 2002; 2 1 :7749- 
7763; Kikuchi, T. et al (2003) "Expression Profiles Of Non-Small Cell Lung 
Cancers On Cdna Microarrays: Identification Of Genes For Prediction 
1 0 Oflymph-Node Metastasis And Sensitivity To Anti-Cancer Drugs," 

Oncogene 22:2192-2205). These studies lead to the identification of molecular 
markers with a potential for better diagnosis, more accurate prediction of 
prognosis, and selection of effective treatment modalities. 

To identify expression profiles and biomarkers for pulmonary NET, laser 

15 capture microdissection (LCM) (Emmert-Buck, MR. et al (1996) "LASER 
Capture Microdissection" Science 1996; 274:9981001; Bonner, R.F. etal 
(1997) "Laser Capture Microdissection: Molecular Analysis Of Tissue," 
Science 278:1481,1483) and cDNA microarrays (Schena, M. etal (1995) 
"Quantitative Monitoring Of Gene Expression Patterns With A 

20 Complementary DNA Microarray," Science 270:467 -470; DeRisi, J. et al 
(1 996) "Use Of A cDNA Microarray To Analyse Gene Expression Patterns 
In Human Cancer," Nat Genet 14:457-460) on 17 cases of primary pulmonary 
NET including TC (n=l 1), LCNEC (n=2), SCLC (n=3) and one case of LCNEC 
combined with SCLC are conducted. The resultant clustering of expression 

25 profiles corresponding to the subtype pulmonary NET are verified by real-time 
RT-PCR analysis and matched completely with the histological classification. Of 
48 classifier genes identified, two are subjected to protein expression analysis by in 
situ immunohistochemistry (IHC) on 55 pulmonary NET cases, which result in the 
identification of carboxypeptidase E (CPE) and y-glutamyl hydrolase (GGH) as 

30 diagnostic biomarkers to differentiate low- and intermediate-grades TC and AC 
from high-grade LCNEC and SCLC. Kaplan-Meier survival analysis reveals that 
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the protein expressions of these two biomarkers can serve as prognosis indicators 
for pulmonary NET patients. 

MATERIALS AND METHODS 

Tissue samples. Fresh frozen tissues of 17 primary pulmonary NET were 
5 collected from hospitals over an 1 1-year period. Tissues were flash-frozen after 
surgery and stored at-80°C until used. Histopathological classification of these 
tumors was based on the 1999 WHO/LASLSC classification of "Histological 
Typing of Lung and Pleural Tumors" (see, Travis, W.D. et al (1998) 
"Reproducibility Of Neuroendocrine Lung Tumor Classification," Hum 
10 Pathol. 29:272-279). The tissues were used for microarray and IHC. A total of 68 
cases (29 TCs, five ACs, nine LCNECs, and 25 SCLCs) were used for IHC and 55 
cases generated informative data. Fifty-four of 55 cases have clinical survival data 
and are used for Kaplan-Meier survival analysis. 

Laser capture microdissection. Frozen tissue (0.5 x 0.5 x 0.5 cm) is 
15 embedded in OCT in a cryomold, and immersed immediately in dry ice-cold 2- 
methylbutane at-50°C. Tissue sections (8 |am) are mounted on Silane-coated 
slides and kept at-80°C until use. The slides are fixed by immersion in 70% 
ethanol, stained with H&E and air-dried for 10 min after xylene treatment. 

The PixCell™ LCM system was used for LCM (Emmert-Buck, M.R. et al 
20 (1996) "Laser Capture Microdissection" Science 1996; 274:9981001; Bonner, 
R.F. et al (1997) "Laser Capture Microdissection: Molecular Analysis Of 
TISSUE," Science 278: 1481, 1483). Tumor cells are fused to transfer film by 
thermal adhesion after laser pulse and transferred into tubes for RNA extraction. 
Total RNA is extracted using Micro RNA isolation kit (Strategene, La Jolla, CA) 
25 according to the manufacturer's instructions. RNA quality is evaluated by 

spectrophotometry and gel electrophoresis. Purified RNA is dissolved into 1 1 ^il 
of DEPC-treated water and used for amplification. The amplified RNA is subjected 
to cDNA microarray analysis (Schena, M. et al (1995) "QUANTITATIVE 
Monitoring Of Gene Expression Patterns With A Complementary DNA 
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Micro array, " Science 270:467 -470; DeRisi, J. et al (1996) "Use Of A CDNA 
MlCROARRAY TO ANALYSE GENE EXPRESSION PATTERNS IN HUMAN CANCER," Nat 
Genet 14:457-460). 

Tissue Culture. A cell line derived from normal bronchial epithelium, 
BEAS-2B, is cultured in a serum-free medium, LHC-9, and harvested at passage 
30. Total RNA is isolated from cultured cells using Micro RNA isolation kit 
(Strategene) according to the manufacturer's instructions. 

RNA amplification. RNA amplification was performed as described by 
Luo, L. et al (1999) ("Gene Expression Profiles Of Laser-Captured 
Adjacent Neuronal Subtypes," Nat Med 1999; 5:1 17-122). Briefly, oligo (dT) 
primers with T7 promoter sequence (SEQ ID NO:l) is used to synthesize the first 
strand of cDNA. After the second strand of cDNA synthesis, RNA is amplified by 
using T7 RNA polymerase on the cDNA templates. Two rounds of amplification 
starting with 1 fig of total RNA generate 40-60 |xg of amplified RNA, which is 
used for microarray analysis. 

Microarray, Hybridization, and Analysis. cDNA microarrays with 9,984 
human genes per slide are provided by the Advanced Technology Center (National 
Cancer Institute, Bethesda, MD). Six of 17 samples are hybridized with two slides 
to work out microarray labeling and hybridization procedures for consensus 
expression data (>95% Pearson Coefficient Correlation between two slides 
hybridized with the same samples). The remaining samples are conducted under 
the same labeling and hybridization conditions. RNA (8 ^g), amplified from the 
BEAS-2B cell line (passage 30), is labeled with Cy5-dUTP as a reference. 
Amplified RNA (4 ^ig each) from tumors is labeled with Cy3-dUTP by using 
Superscript II (Invitrogen, Carlsbad, CA). Briefly, RNA is incubated with Cy3- 
dUTP (or Cy5-dUTP) (Perkin Elmer Life Sciences, Boston, MA) at 42°C for 1 h to 
synthesize the first strand cDNA. The reaction is stopped by the addition of 5 fil 
0.5M EDTA and the RNA is degraded by the addition of 10 ^il IN NaOH and then 
incubation at 65°C for 60 min. After neutralizing, the samples are purified by 
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Microcon 30 (Millipore Corp., Bedford, MA). Each pair of labeled samples is 
hybridized to DNA on slides at 65°C for 1 6 h. After washing, the slides are 
scanned with a GenePix 4000A scanner (Axon Instruments, Inc., Foster City, CA). 
Hierarchical clustering and gene selection are performed by using BRB- 
5 ArrayTools V 3.0 (National Cancer Institute, Bethesda MD, 
http://linus.nci.nih.gov/brb'> . 

Real-time PCR. Total RNA is purified from LCM cells, using the 
Stratagene Absolutely RNATM microprep kit. Samples are treated by DNase I to 
eliminate DNA contamination. Primers are designed, using Primer Express 

1 0 Software V 1 .5 (Applied Biosystmes Inc., Foster City, CA) based on sequences 
from GenBank and purchased from Biosource International (Camarillo, CA). 
Final probe concentration was 200 nM for each gene. Endogenous 18s RNA 
(Applied Biosystems) is used as an internal reference. Reverse transcription is 
completed with the RT-EZ RNA kit (Applied Biosystems) according to the 

1 5 manufacturer's instructions. Samples are run in triplicate and monitored on the ABI 
PRISM 7700. 

Immunohistochemistry. Immunohistochemistry is performed by the 
avidin-biotin peroxidase complex (ABC) method (Vectastain Elite ABC kit, 
Vector, CA). Briefly, slides are deparaffinized, and rehydrated through xylene and 

20 alcohol in Coplin jars. Endogenous peroxidase is blocked with 3% H 2 0 2 in 
phosphate-buffered saline (PBS) for 20 min. All washes are in PBS at room 
temperature if not mentioned. After two washes, Heat Induced Epitope Retrieval 
(HIER) is performed in a citrate buffer (pH: 6.0) in a Biocare Medical chamber 
(Walnut Creek, CA). Slides are rinsed, enclosed with a PAP pen, placed in the 

25 humid chamber, and incubated first with Protein Block (normal GOAT serum 
diluted in PBS containing 1% BSA, 0.09% sodium azide, 0.1% Tween-20 
[BioGenex, CA]), and then with primary antibody: GGH (rabbit polyclonal, Dr. 
Thomas J. Ryan, Wadsworth Center, NY State Dept. of Health, Albany, NY, 
1:1000 diluted by Universal blocking reagent [BioGenex]) and CPE (rabbit 

30 polyclonal, Dr. Lloyd Fricker, Albert Einstein College of Medicine, NY, 1 :500 
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dilution) for 1 h. After three washes, slides are incubated for 30 min with 
biotinylated goat anti rabbit IgG (Vector, 1:250 dilution). After three washes, the 
slides are incubated for 45 min with the ABC reagent (Vector). Slides are washed 
twice, placed in Tris-HCl buffer (pH 7.5) for 5 min, developed with liquid DAB 
(DAKO, CA) for 3 min, washed with H 2 0 twice, and finally counterstained lightly 
with Mayer's hematoxyline for 5 sec, dehydrated, cleared, and mounted with 
resinous mounting medium. Signal intensity and distribution are based on the 
publication (Gillett, C. etal (1994) "Amplification And Overexpression Of 
Cyclin D 1 In Breast Cancer Detected By Immunohistochemical 
Staining," Cancer Res 54: 1812-1817; Beasley, M.B. et al (2003) "The 
P 1 6/Cyclin D 1/Rb Pathway In Neuroendocrine Tumors Of The Lung," Hum 
Pathol. 34:136-142) and scored blindly by three pathologists as follows: 
distribution score (DS) is graded as 0, absent; 1, <10%; 2 10% to 50%; 3, 51% to 
90%; or 4, >90%. Intensity score (IS) is graded as ISO, no signal; IS1, weak; IS2, 
medium; or IS3, strong. The combined total score is determined as total score (TS) 
= distribution (DS) + intensity (IS) (TS0, sum 0; TS1, sum 1 to 3; TS2, sum 4 to 5; 
TS3, sum 6 to 7). TS0 and TS1 are considered negative, whereas TS2 and TS3 are 
considered positive, respectively. 

Statistics. Binomial distributions are used to compute p-values between 
positive and negative immunohistochemical stains of anti-CPE or anti-GGH 
antibodies to tissue sections. Kaplan-Meier survival is calculated in the statistic 
software SPSS 9.0 for Windows. A p-value less than 0.05 or 0.01 is used as 
significant or very significant statistical indicator, respectively. 

RESULTS 

Microarray analysis and expression classification of pulmonary NET. 
Homogeneous cancer cells are collected from pulmonary NET tissue sections by 
LCM avoiding contamination with other cells to conduct microarray analysis of 
gene expression. LCM is performed on 15-18 frozen sections per sample to 
maximize the number of homogeneous cells from each of 17 available fresh frozen 
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pulmonary NET (1 1 TC, two LCNEC, three SCLC, and one combined SCLC and 
LCNEC). High quality total RNA (>1 ^ig/sample) is purified from the dissected 
cells and subjected to two rounds of RNA amplification by T7 RNA polymerase 
(Luo, L. et al (1999) "Gene Expression Profiles Of Laser-Captured 
5 Adjacent Neuronal Subtypes" Nature Medicine 5:11 7-22 1 6) for microarray 
analysis. cDNA microarrays of 9,984 genes are hybridized by Cy3-Iabeled cDNA 
from 4 ^ig tumor RNA and Cy5-labeled reference cDNA from 8 ng RNA of the 
normal bronchial epithelial cell line BEAS-2B(ReddeI, R.R. etal (1988) 
"Transformation Of Human Bronchial Epithelial Cells By Infection 

10 With SV40 Or Adenovirus- 12 SV40 Hybrid Virus, OrTransfection Via 
Strontium Phosphate Coprecipitation With A Plasmid Containing S V 40 
Early Region Genes, 55 Cancer Res 48:1904-1909) for all 17 samples. 
Hierarchical clustering analysis on expression levels of 9,984 genes without prior 
knowledge of sample identity reveals the sample clusters matching histological 

15 classification. An F-test is then conducted by use of the BRB array tool to measure 
variance in gene expression in each sample among three defined subtypes. Based 
on arbitrary criteria of 2-fold changes and p-value <0.004, 198 genes are identified 
(Table 6) that also clustered the 17 tumors into groups in agreement with the 
morphological classification (Figure 4). 



Table 6 

Cluster Genes, Using Average Linkage and EuclideanDistance, and Cutting Tree at 

Three Clusters 


No. 


Unique 
ID 


Gene Symbol 


Map 


Clone 
Incyte PD No. 


UG Cluster 


Cluster # 1 


1 


166807 


GRIA2 


4q32-q33 


1505977 


Hs.89582 


2 


159877 


CPE 


4q32.3 


2153373 


Hs.75360 


3 


161598 


ORC4L 


2q22-q23 


2728840 


Hs.55055 


4 


167158 


C5 


9q32-q34 


1712663 


Hs.1281 


Clust 


er #2 


5 


167153 


GGH 


8q12.1 


1997967 


Hs.78619 


6 


160605 


P311 


5q21.3 


1555545 


Hs. 142827 


7 


169429 


NR3C1 


5q31 


629077 


Hs.75772 


I 8 


165192 


SYNJ2 


6q25-26 


3954785 


Hs.61289 


9 


165784 


ADD3 


10q24.2-q24.3 


1481225 


Hs.324470 


10 


163031 


KIM0751 


8q23.1 


2369544 


Hs. 1536 10 


11 


166328 


PSMC6 


12q15 


1488021 


Hs.79357 
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Table 6 " 

Genes, Using Average Linkage and EucIideanDistance, and Cutting Tree at 
- , Three Clusters 



No. 


Unique 
ID 


Gene Symbol 


Map 


Clone 
Incyte PD No. 




12 


168061 


FTHFD 


OC[£i 1 .O 


2104145 


Hs.9520 


13 


168141 


DGKG 


oC[dL( -q^o 


2568547 


Hs.89462 


14 


165076 


SMG1 


(Op Itt.O 


4253663 


Hs. 11 061 3 


15 


167103 


TAF2 


ftn9A 4 9 


998069 


Hs. 122752 


16 


169391 


EIF2S1 


i*fq^o.o 


1224219 


Hs. 151777 


17 


166789 


2NF202 


i iq^o.o 


1 997937 


Hs.9443 


18 


167316 


SLC24A1 


1 ^n99 


2200079 


Hs. 173092 


19 


168700 


FPRL 1 


i »q i o.o-q io.4 


523635 


Hs.99855 


20 


165576 


IL6ST 


5n1 1 

«JWJ 1 1 


t 2172334 


Hs.82065 


21 


168276 


ITGBL 1 




1258790 


Hs.82582 


22 


169180 


IL8RB 




561992 


Hs.846 


23 


160957 


PRKM2 


1n31 


<:507648 


Hs.2329 


24 


160617 


CSF2RB 


I O. I 


1561352 


Hs.285401 


25 


160429 


PTK6 


cUlj IO.O 


3255437 


Hs.51133 


26 


160237 


NPAT 


1 1 r«99_ri9 J 3 


O O f\ ft t~ r*i r- 

2308525 


Hs.89385 


27 


167125 


TNFRSF6 


i uq^*t. I 


2205246 


Hs.82359 


28 


164652 


PDGFRB 


oqo l-qo^ 


1821971 


Hs.76144 


29 


161117 


ABCG2 


4n99 


1501080 


Hs. 194720 


30 


161896 


COL 15A1 


yq<i i-qz^ 


4287342 


Hs.83164 


31 


159813 


PTPN1? 
i i tin ic 


/q i i .^o 


1382374 


Hs.62 


32 


164573 


DMTF1 




1637517 


Hs.5671 


33 


169384 


SLC22A1 LS 


I I p I 0.0 


3842669 


Hs.300076 


34 


165393 






3202075 


Hs.351699 


35 


168169 


OXCT 


OfJ 1 o 


1685342 


Hs. 177584 


36 


165617 


PRLR 


«J|J | *t-p I o 


3427560 


Hs.1906 


37 


169432 


IL 13RA2 


'nwj i o. i-q<io 


3360476 


Hs.25954 


38 


166812 


MPZL 1 




2057323 


Hs.287832 


39 


168428 


RU NX3 


I poO 


885297 


Hs.170019 


40 


167180 


S100A4 


1n91 


1222317 


Hs.81256 


41 


161533 


CSTF2 


Xn91 ^ 


4016254 


Hs.693 


42 


165588 


SNAPC4 


yqo*Ko 


2224902 


Hs. 11 3265 


43 


164799 


EMP3 


lyq i o.o 


780992 


Hs.9999 


44 


161709 


FLJ 11560 


?p 


1990361 


Hs.301696 


45 


164868 


GBP2 


1 pter-p13.2 


161099^ 


ns. 1 1 \ xyott 


46 


160233 


DYRK3 


1q32 


614679 


Hs.38018 


47 


165400 


MY040 ] 


f q35-q36 


2048144 


Hs. 124854 


48 


165957 


PNLIPRP2 


I0q26.12 


885032 


Hs.143113 


49 


160054 


SEC4L 


I7q25.3 


1824556 


Hs. 302498 


50 


162475 


CTAG2 ) 


<q28 


849425 


Hs.87225 


51 


169182 


LOC5631 1 7 


'q31 


2013272 


Hs.73073 


52 


162912 


DKFZP566B084 I 


>q13 


2680168 


Hs.21201 


53 


163475 


FLJ20485 7 


r q22.1 


2299818 


Hs.98806 


54 


164927 


HNRPAO i 


Iq31 


637639 


Hs. 77492 


55 160630 


HOXD9 2 


Iq31-q37 


2956581 | 


Hs.236646 
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Three Clusters 


No. 


Unique 

in 
LiJ 


Gene Symbol 


Map 


Clone 
Incyte PD No. 


vj ^msier 


56 




JUN 


1 p32-p31 


1969563 


Hs.78465 


57 


1fi^7R9 




1 17 


1743234 


Hs. 120854 


58 


162247 


VLGR1 


5q13 


942207 


Hs. 153692 


59 


1R791Q 

IU / 1 v? 


PUM1 


1p35.2 


3333130 


Hs. 153834 


Clus 


ter #3 






60 


165171 


KRT18 


12q13 


14^5T74 


Hs.65114 


61 




CDC20 


9q13-q21 


94RQ5Q9 


Hs.82906 


62 


1R7Q48 


PIM1 


6p21.2 


2R7Q1 17 

£.\J /Oil/ 


Hs.81170 


63 


1R1Q54 


ATP6F 


1p32.3 


5017148 
\J\J I / i*+o 


Hs.7476 


64 


1R9**Q1 


P0LE3 


9q33 


qfi10A9 


Hs.108112 


65 


1fififi^5 

I UUUOU 


KRT5 


12a12-a13 




Hs. 195850 




iroo^ 


FEN1 


llql2 




Hs.4756 


67 


1 fi 1 774 


SIP2-28 


15a25 3-a26 


4R9ftAQcr 


Hs. 10803 


68 


169907 

1 VJr r \J f 


VAT! 


17q21 


90Rn*5nn 


Hs. 157236 


69 


1R1 1R1 
lull DO 


GUK1 


1 q32-q41 




Hs.3764 


70 


1R199? 


SIVA 


99 




Hs.1 12058 


71 


1R191 1 


CAPG 


2cen-q24 


950857H 


Hs.82422 


72 


161Q48 


CLDN11 


3q26.2-a26 3 


4144001 


Hs.31595 


73 


161^Q1 


IL17F 


6p12 


1R1008^ 


Hs.272295 


74 


162571 


PFKL 


21q22.3 


885R01 


Hs. 155455 


75 


1R4504 


CTSC 


11q14 1-a14 3 


189971R 


Hs. 10029 


76 


1R05R5 


ACY1 


3p21 .1 


1 ft1 once 


Hs.334707 


77 


1RQ551 


GSK3B 


3q13.3 


9057Q08 


Hs.78802 


78 


1RRQ14 


METTL 1 


12q13 


1R0^584 


Hs.42957 


7Q 


1R777P 


CYP27B 1 


12q13 1-a13 3 


1 74Q797 


Hs. 199270 


80 


i ouyoo 


HMGE 


4p16 1 


9074 1 54 


Hs.151903 


81 


1R97^4 


WNT7 A 


3p25 


2622566 


Hs.72290 


89 


IUJO I O 


CASP4 


11 q22.2-q22.3 


2304121 


Hs.74122 


83 


15Q8Q8 


PTTG1 


5q35.1 


1748705 


Hs.252587 


84 


1R1944 


ARF4L 


17q12-q21 


2852403 


Hs.183153 


85 


1R071 5 


CDC34 


19p13.3 


1857493 


Hs.76932 


86 


163787 


PYCR1 


17q24 


1702266 


Hs.79217 


87 


160127 


PGAM1 


10q25.3 


3032691 


Hs.181013 


88 


160323 


ATIC 


2q35 | 


2056149 


Hs q0980 


89 


164850 


IRAKI 


Kq28 


1872067 


Hs. 182018 


90 


165583 


DHCR7 


11q13.2-q13.5 


3518380 


Hs.11806 


91 


165039 


TK1 


17q23.2-q25.3 


2055926 


Hs. 105097 


92 


167964 


CDKN2A S 


3p21 


2740235 


Hs.1174 


93 


167223 


GNB1 


lp36.21-36.33 


3562795 


Hs.215595 


94 


167931 


CSTF1 : 


>0q13.2 


1635008 


Hs. 172865 


95 


163690 


HXB < 


3q33 


1453450 


Hs.289114 


96 


161955 


CNTN2 


I 332.1 


4014715 


Hs.2998 


97 


160275 


SSRP1 ] 


lql2 


2055773 


Hs.79162 


98 


168110 


TAF12 1 


p35.1 


1297269 


Hs.82037 
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Table 6 

Genes, Using Average Linkage and EuclideanDistance, and Cutting Tree at 
. Three Clusters 



No. 


T Ininn t- 

ID 


Gene Symbol 


i Map 


Clone 
Incyte PD No. 


UG Cluster 


99 


160102 


ERP70 


if 

I V 


* 1 824957 


Hs.93659 


100 


167116 


NP 


14q13.1 


^40o4oo 


Hs.75514 


101 


160802 


PHB 


17q21 


lo^oioy 


Hs.75323 


102 


161643 


ARL7 


2q37.2 


71 1 RG.'iA 


Hs.11 1554 


103 


162343 


LIMK2 


22q 12.2 


QCOCi Q 

yoooi o 


Hs. 278027 


104 


162727 


PTK9L 


3p21 .1 




HS.6780 


105 


160262 


DDX28 


16q22.1 




HS. 155049 


106 


165790 


SURF1 


9q33-q34 


1 Q91 Rf\7 
I <3c. I OO/ 


HS.3196 


107 


168638 


HDAC7A 


12q13.1 




HS.2/5438 


108 


168079 


EMP1 


12p12.3 




HS. f 9oo8 


109 


160999 


P114-RHO-GEF 


19p13.3 


1 1 04 1 1 O 


nS.DloO 


110 


161790 


KIM0469 


1 d36 23 




Hs.7764 


111 


169691 


E2-EPF 


17d12-d11 




Hs. 174070 


112 


163682 


DPH2L2 


1p34 


I O I Uo21 


Hs. 324830 


113 


168266 


PSME3 


17a12-a21 


i7nai 19 

lOUO I 1 2 


Hs. 152978 


114 


161374 


POLA2 


11q13.1 


71 7Q*1 *! *a 


Hs.81942 


115 


164646 


GALE 


1 d36-d3*5 


! *iftri7oo/i 
iou/2y4 


Hs.76057 


116 


162150 


APOL 1 


22q13.1 


^uooyo/ 


Hs. 11 4309 


117 


164206 


FN14 


16d13 3 


I4U201 o 


Hs. 10086 


118 


162623 


BAK1 


6p21.3 


rflUOOOO/ 


Hs.93213 


119 


162244 


ARHGDIA 


17q25.3 


ZU00D4U 


If A c c\ A r* 4 

Hs. 159161 


120 


164586 


ITPA 


20p 


10,71 9fi5 


HS.D817 


121 


165483 


PDAP1 


7q22.1 


7H79P9A 


HS. 278426 


122 


166195 


APRT 


16q24 


9751 7ft7 


i i — oonn a 

HS. 28914 


123 


166960 


APG12L 


5a21-a22 


9H5Q577 
gUOOOOf 


HS. 264482 


124 


167505 


TST 


22q13.1 


1Qftft97Q 


ii— n c a oo o 

Hs. 351863 


125 


168642 


ST14 


1 1 a24-a25 


4/ oyou 


HS.00937 


126 


167170 


DXS1283E 


Xp22.3 


i oo/ yyo 


i i— f%r+ a 

Hs.264 


127 


161754 


ACTG2 


2p13.1 


77fl*l Q7r\ 


ii _ Ton i r 

Hs.78045 


128 


166010 


RIPK1 


Bp25.3 


91 pn,n,7i 

£. I OUUol 


HS. 296327 


129 


161794 


SCAMP2 


1 5a23-a25 


7197P5Q 


Hs.238030 


130 


167591 


COMT 


22qll.21 


ououi y 


Hs.240013 


131 


162587 


POLR2D 


2q21 




i i— a c\ a a a 

HS. 194638 


132 


169071 


CAPZB 


1 p36.1 


1853163 


Hs. 3334 17 


133 


160467 


POLD2 


?p13 


2056172 


Hs.74598 


134 


162178 


C2F 


12p13 


5096975 


Hs. 12045 


135 


167706 


GMPPB 


3p21.31 


1486983 


Hs.28077 I 


136 


160803 


FARSL 


19p13.2 


1808260 


Hs.23111 H 


137 


169254 


POLM 


?p13 


771715 


Hs.46964 


138 


167351 


MYBPH 


I q32.1 


3010959 


Hs.927 


139 


163276 




7 


2383065 


Hs.25892 


140 


167135 


ERCC1 


I9q13.2-q13.3 


2054529 


Hs.59544 | 


141 


160478 


G5B I 


5p21.3 


1942845 


Hs.73527 


142 


162631 " 


TADA3L C 


Jp25.2 


3990209 


Hs.158196 
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Muster Gei 


Table 6 

les, Using Average Linkage and EucIideanDistance, and Cutting Tree at 
Three Clusters 


No. 


Unique 
ID 


Gene Symbol 


Mao 


Clone 
Incyte PD No. 


uo Cluster 


A A 1 

14o 


A COftO A 

163921 


oNPI 


5q21 


1653911 


Hs.278500 


AAA 
144 


160098 


MKPL49 


11q13 


1755793 


Hs.75859 


4 AG, 
140 


A CIA nco 

161058 


IVltNl 


11q13 


1693847 


Hs.24297 


A AC* 


160038 


BAD 


11q13.1 


3967780 


Hs.76366 


147 


1 62220 


rrxBPIA 


20p13 


4059193 


Hs.349972 


A A O 

148 


A ry A nor 

161026 


HSXQ280RF 


Xq28 


1669254 


Hs.6487 


149 


167607 


1 KAP1 


16p13.3 


1960722 


Hs. 182366 


150 


167713 


r\IM01 fo 


9p11.2 


3805046 


Hs. 184339 


AHA 

151 


165648 


UUor4 


8p12-p11 


740878 


Hs.2359 




lol 0/4 


CDATO 
Pr\M 1 Z 


1 Oq23-q24. 1 


3871545 


Hs. 140720 


a *;q 


lolboO 


rUIVIU4 1 O 


7p22.2 


I 2798872 


Hs.229950 


1 04 


1 00000 


MPii r*A 


10 


1431819 


Hs.75337 


1 **** 


1 oyyuo 




11 q23.2-q23.3 


1704168 


Hs. 147097 


I OO 


ib/yub 


KAt 1 
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Hs.1770 
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Hs.78853 
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161012 
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1699149 


Hs.75354 
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162006 


REG1B 


2p12 


2374294 


Hs.4158 


AC A 
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161454 


SPINT1 


15q13.3 


2722572 


Hs.233950 
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162510 


CAMKK2 


12 


557451 


Hs. 108708 
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163306 
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15q26.1 


2923082 


Hs. 36820 
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160242 
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Hs.21577 
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164106 


GRWD 


19q13.33 
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Hs.218842 
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165799 


MADH3 


15q21-q22 


1858365 


Hs.211578 
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166574 


SNAPC2 
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Hs.78403 
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175 


161 178 


BSG 


19p13.3 
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Hs.333069 
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TYMSTR 
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2610374 


Hs.34526 


183 


159859 


GS2NA 
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Hs.183105 


184 
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2395018 
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PEX10 
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Table 6 

luster Genes, Using Average Linkage and EuciideanDistance, and Cutting Tree at 

Three Clusters 


No. 


Unique 

ID 


Gene Symbol 


Map 


Clone 
Incyte PD No. 


uu v^iuscer 


186 


<1CCC7<| 

IDOO/ \ 


A MYA1 


4q 13-q22 


1920650 


Hs.1378 


187 




iNrrxblfc 


6p21 .1 


2748942 ^ 


Hs.91640 


188 


165786 


HY AL2 


3p21.3 




Hs.76873 


189 


161620 


H4FE 


6p22-p21.3 


3728255 


Hs.278483 


a r\r\ 


168302 


TIP-1 


17p13 


1997792 


Hs.12956 


191 


160887 


PES1 


22q12.1 


2758740 


Hs.13501 


192 


162419 


RAE1 


£0q13.31 


588157 


Hs. 196209 


193 


169625 


RFC4 


3q27 


1773638 


Hs.35120 


194 


163425 


TCEA2 


20 


818568 


Hs.80598 


195 


166359 


TUBB 


6p21.3 


3334367 


Hs.336780 


196 


161947 


TIM17B 


Xp11.23 


1727491 


Hs.19105 


197 


162236 


KIM0670 


14q11.1 


1968610 


Hs.227133 


198 


168426 


RTVP1 | 


12q15 


477045 


Hs.64639 



Classifier genes for pulmonary NET grades. To identify the classifier genes 
for each tumor subtype independent of the reference cell line, BEAS-2B, two-by- 
two comparisons are conducted on relative expression ratios in the 198 genes 
between three tumor subtypes. Of 198 genes, 178 show at least a 2.5-fold or 
5 higher differential expression between at least one pair of the comparisons 

including TC/LCNEC, TC/SCLC, LCNEC/TC, LCNEC/SCLC, SCLC/TC, and 
SCLC/LCNEC. Using the criteria that the expression of a gene in any one subtype 
is higher than those in the other two, 48 genes are identified including five in TC, 
seven in LCNEC and 36 in SCLC. Each group of the classifier genes can 
10 distinguish one tumor subtype from the other two. Table 7 lists the expression 
ratios of 48 classifier genes along with major function, chromosome location, 
known cytogenetic alteration and UniGene Cluster number. 
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Table 7 

n Ratios of 48 Classifier Genes between TC. LCNEC (LC\ anrt sni r «r\ 


No. 


Gene 
symbol 


Expression 
Ratio 


Function 


Map 


Cyto- 
genetic 
Alteration 


UG cluster 






TC/SC 


TC/LC 










1 


C5 


5.6 


7.5 


Immune 


9q32-q34 




Hs.1281 


2 


CPE 


6.3 


4.2 


Biosynthesis 


4q32.3 


Yes 


Hs.75360 


3 


GR1A2 


5.5 


4.0 


Receptor 


4q32-q33 


Yes j 


Hs.89582 


4 


RIMS2 


3.1 


2.6 


Synaptic exocytosis 


8q23.1 




Hs. 153610 


5 


ORC4L 


2.7 


3.2 


DNA replication 


2q22-q23 


Yes 


Hs.55055 
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KI- 
NO 


Gene 
symbol 


Expr 
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ession 
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Function 


Map 


Cyto- 
genetic 
Altera tlor 


UG cluster 






LC/TC 


LC/SC 










1 


CSF2RB 


I 3.8 


4.2 


Receptor 


22q13.1 


Yes 




2 


GGH 


; 4.8 


6.3 


Drug resistance 


8q12.1 


Yes 


Hs 7861 Q 


3 


NPAT 


2.5 


3.8 


Cell cycle 


11q22-q23 






4 


NR3C1 


3.8 


5.7 


Transcription factor 


5q31 


Yes 


Hs 7577? 

1 1 w » f w g $ ^ 


5 


P311 


5.6 


7.1 


Transformation 


5q22.2 


Yes 


Hs 413760 


6 


PRKAA2 
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4.2 


Metabolism 


1p31 




Hs 2329 


7 


PTK6 


2.7 


3.6 


Oncogene 


20q13.3 


Yes 


Hs 51 133 
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SC/LC 










1 


APRT 


5.1 


2.9 


Metabolism 


16q24 




Hs.28914 


2 


ARF4L 


5.4 


3.8 


Protein secretion 


17q12-q21 




Hs 183153 


3 
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3.7 


2.5 
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17q25.3 




Hs 159161 


4 
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3.0 


Endocytosis 
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Hs 1 1 1 ttd 
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Proton transport 


1p32.3 




Ule 7 47 ft 
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7.5 


3.3 


Cell Cycle, G 1 


1p34.1 


Yes 


Uc 89QflR 
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5.5 
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Cell Cycle, G2 


19p13.3 


| Yes 


Hs 49361*^ 
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6.2 


2.9 


Tight junction 


3q26.2-q26.3 


Yes 


Hs 3159^ 
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3.3 


2.6 


Neurotransmission 


22q11.21 


Yes 


Hs 2400 13 
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2.8 


2.6 


Polyadenylation 


20q13.2 




Hs 1798fiR 


11 


DDX28 


4.8 


3.1 


RNA helicase 


16q22.1 


Yes 


Hs 155049 
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Hs 93659 
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6.5 
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3.3 
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2.9 
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18 
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12q13.1 
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his 6817 
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2.6 


Cell adhesion 


17q21 


Yes 


Hs 2340 


21 


KIAA0469 
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5.7 
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Yes 
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Yes 
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4.9 
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Hs.75323 
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6.5 


3.6 
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No. 

~35~ 
36 


Gene 
symbol 

TYMSTR 
VATI 


Expression 
Ratio 

3.0 1 2.5 
1 4.5 ( 3.4 


Function 

Signal transduction 
Neurotransmission 


Map 

3p21 
17q21 


Cyto- 
genetic 
Alteration 

Yes 


UG cluster 

Hs.34526 
Hs. 157236 



Validation of gene expression changes by real-time quantitative RT- 
PCR. To validate the gene expression profde and the classifier genes, real-time 
RT-PCR analysis are performed on three classifier genes in the 17 pulmonary NET 
using RNA extracted from tumor cells collected by LCM. One gene from each 
tumor subtype is picked based on highly differential expression for the 
confirmation. The expression of CPE, P311 and CDC20 detected by real-time 
quantitative RT-PCR in each of 17 pulmonary NET is first normalized as a ratio to 
the control gene 18S RNA in that tumor and then compared with the expression in 
the reference BEAS-2B cell line. The results show that the expression changes of 
these genes were highly consistent between those detected by the two methods 
(Figures 5A-5F). 

Correlation of CPE and GGH protein expression to pulmonary NET 
grades. To initiate the identification of protein markers for analysis of archived 
pulmonary NET tissue sections, anti-CPE and anti-GGH antibodies are used to 
detect CPE and GGH expression on 68 available pulmonary NET samples 
including 17 used in the microarray analysis, and generated informative data on 55 
cases. The images stained by anti-CPE antibody on the normal lung tissue 
sections, TC, LCNEC and SCLC were studied. No signal is detected in bronchial 
epithelial cells or pneumocytes of normal lung. Some strong staining appears in 
scattered neuroendocrine cells of terminal bronchiolar epithelia and in some 
macrophages. The TC sample displays a positive stain with strong and uniform 
signals on the cell membrane. The LCNEC section have a very weak and scattered 
anti-CPE stain, and the SCLC are completely negative. Only occasional tumor 
cells exhibit a weak intracytoplasmic stain. The images obtained by staining with 
anti-GGH antibody were also studied. Normal lung showed negative staining. TC 
cells also exhibited negative staining. The tumor cells have no detectable signals 
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and mild staining can be seen only in scattered stromal cells. LCNEC cells stained 
positively. All tumor cells show intracytoplasmic stain, with most staining seen in 
the cytoplasm with a course granular staining pattern. SCLC cells show 
intracytoplasmic stain with course granular pattern. 

5 Table 8 summarizes the results of anti-CPE and anti-GGH stains on the 55 

pulmonary NET samples. The statistical analysis is conducted, based on the 
binomial distributions of positives and negatives. Of 21 cases of TC, 16 (76%) 
were positive to anti-CPE stain and five (24%) are negative. The difference is 
statistically significant (p- value <0.05). The anti-GGH stains on 21 cases of TC 

10 revealed seven positive (33%) and 14 (67%) negative, but there is no statistical 
significance (pvalue>0.05). Four of five (80%) AC cases are positive to the anti- 
CPE stain and all of the five (100%) cases are negative to the anti-GGH, but this 
apparent difference is not statistically significant (p-value >0.05) in light of the 
small sample size (n=5). Although the negative stains of anti-CPE are dominant 

15 events for LCNEC (seven negative versus one positive, 88%) the difference has no 
statistical significance (p-value >0.05), probably due to the small sample size. All 
eight cases of LCNEC are positive to anti-GGH stains (p-value <0.01). Of 21 
cases of SCLC, only four (19%) are positive to anti-CPE stain (p-value O.01). In 
contrast, 16 (76%) are positive to anti-GGH stain (p-value O.05). Therefore, 

20 positive CPE stain is associated with low- and intermediate-grade TC and AC 
while positive GGH stain is associated with high-grade LCNEC and SCLC. 



Table 8 

Immunochemistry on 55 Pulmonary NE Tumors 


Pulmonary 
NE Tumor 


Anti-CPE IHC 


Anti-GGH IHC 


Positive 


Negative 


p-value 


Positive 


Negative 


p-value 


TC 


16 


5 


0.017 


7 


14 


0.189 


AC 


4 


1 


0.625 


0 


5 


0.063 


LCNEC 


1 


7 


0.070 


8 


0 


0.008 


SCLC 


4 


17 


0.007 


16 


5 


0.017 


Total 


23 


32 




31 


24 





CPE and GGH protein expressions predict survival rates of the 
pulmonary NET patients. After the correlation of CPE and GGH expressions to 
pulmonary NET grades, a Kaplan-Meier survival analysis' is conducted on 54 cases 



-69- 



of the pulmonary NET patients with clinical survival data as the function of CPE 
or GGH stains. The 9-year survival probability for the patients with a positive 
CPE is 76%, significantly (p-value <0.05) higher than that with a negative CPE, 
27% (Figure 6A). In contrast, the 9-year survival probabilities for the patients with 
positive and negative GGH staining are 28% and 83%, respectively (Figure 6B). 
The difference is statistically very significant (p-value <0.01). Thus, positive CPE 
and negative GGH are the good prognostic indicators for pulmonary NET patients. 

In the above-described study, the expression of 9,984 genes in pulmonary 
NET are examined and the expression profile, 49 classifier genes and two 
biomarkers are identified. Homogenous cancer cells are collected by LCM from 
1 1 cases of TC, three cases of SCLC, two cases of LCNEC and one case of 
combined SCLC and LCNEC. High quality RNA is extracted from the 
homogeneous cancer cells and subjected to T7 polymerase-based RNA 
amplification. cDNA microarray and unsupervised expression cluster analyses of 
9,984 genes or 198 significantly (p<0.004) differentially expressed genes classified 
17 cases of pulmonary NET into three groups that matched their histological 
classifications completely. In addition, 48 classifier genes are identified by 2-by-2 
expression comparisons of 198 genes between three subtype tumors. The 
expression changes of representative genes are confirmed by real-time quantitative 
RT-PCR. Finally, based on expression profile and by IHC, it is found that positive 
CPE and negative GGH are more frequent events in low-grade TC and 
intermediate-grade AC than in high-grade LCNEC and SCLC and are good 
prognostic indicators for the pulmonary NET patients. 

Expression clustering was developed to analyze gene expression data from 
DNA microarrays(Eisen, M.B. et al. (1998) "Cluster Analysis And Display Of 
Genome- Wide Expression Patterns," Proc Natl Acad Sci USA 95:14863- 
14868). The analysis is based on statistical algorithms to arrange genes and tumors 
according to similarities in gene expression. The dendrogram is the most common 
output to reveal a subclass of genes and cells. In the above study, the expression 
pattern of 9,984 genes or selected 198 genes accurately distinguishes each subtype 
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of 17 pulmonary NET classified by histologic characteristics. It is considered that 
precise LCM of the cancer cells and non-biased RNA amplification contributes to 
the accurate expression classification. 

Luo et al.(l 999) (Gene Expression Profiles Oflaser-Captured 
AdjacentNeuronal Subtypes," Nat Med 5:1 17-122) reported T7 polymerase- 
based RNA amplification (Van Gelder, R.N. etal. (1990) ("Amplified RNA 
Synthesized From Limited Quantities Of Heterogeneous cdna," Proc Natl 
Acad Sci USA 87:1663-1667) to amplify RNA isolated from LCM cells for DNA 
microarray study. In that case, total RNA was extracted from 1,000 neuron cells 
dissected by LCM and subjected to three rounds of amplification before microarray 
analysis, of which, the correlation of signal intensities between the same samples 
varied from 93% to 97% (Luo, L. et al. (1 999) "Gene Expression Profiles Of 
Laser-Captured Adjacent Neuronal Subtypes," Nat Med 1 999; 5 : 1 1 7- 1 22). 
In the above-described study, total RNA is extracted from >10,000 cancer cells 
15 dissected by LCM from at least 15 sections and subjected to only two rounds of 
amplification. These modifications contribute to accurate clusters. 

A reference sample is used as a control to normalize gene expression in test 
samples in cDNA microarrays. To obtain enough common RNA as a reference for 
all test samples is frequently difficult, particularly for a large number of primary 
20 tumors. To date, pooled normal samples or samples pooled from a portion of each 
test sample have been used as a reference. In this and other studies (Miura, K. et 
al. (2002) "Laser Capture Microdissection And Microarray Expression 
Analysis Of Lung Adenocarcinoma Reveals Tobacco Smoking- And 
Prognosis-Related Molecular Profiles," Cancer Res 2002; 62:3244-3250), 
25 the RNA employed is isolated from the immortalized bronchial epithelial cell line, 
BEAS-2B (Reddel, R.R. etal. (1998) "Transformation Of Human Bronchial' 
Epithelial Cells By Infection With SV40 Or Adenovirus- 12 SV40 Hybrid 
Virus, Or Transfection Via Strontium Phosphate Coprecipitation With A 
Plasmid Containing SV40 Early Region Genes," Cancer Res 48: 1904-1909), 
30 as the reference for all test samples. Because the results demonstrated accurate ' 
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classification, RNA from the cell line can be used as the reference for primary 
tumors. Thus, this method may be applicable to microarray analysis of gene 
expression of any cells where a reference sample is not easily obtained. 

Using the Class Comparison analysis (or Gene Selection) of the BRB array 
tool, 198 genes are selected out of 9,984 genes (1.98%) for expression 
classification of 17 pulmonary NET. The clusters based on the 198 genes coincide 
well with those based on 9,984 genes. Two-by-two comparisons of 198 gene 
expression between the three subtypes of pulmonary NET result in the 
identification of 48 classifier genes of which the expression changes are able to 
distinguish the subtypes. The classifier genes are involved in complex regulations 
of apoptosis, cell-cell and cellmatrix interactions, cell cycle, DNA synthesis and 
repair, drug resistance, RNA synthesis and processing, and cell survival. The 
classifier genes provide candidates for understanding and studying pulmonary NET 
biology and the identification of more biomarkers. 

The present invention thus provides the first report that correlates CPE and 
GGH expression patterns to pulmonary NET grades and prognosis. The IHC 
reveal patterns of CPE and GGH expression in pulmonary NET cells. Specifically, 
the frequency of positive staining by anti-CPE in TC (76%) is 4-fold higher than 
that in SCLC (19%). Although the trends of high and low frequencies of positive 
CPE seem apparent in AC and LCNEC, respectively, the statistical significance 
was not reached, perhaps due to the small sample sizes. In contrast, both LCNEC 
and SCLC cells displayed highly significant frequencies of positive anti-GGH stain 
than TC and AC cells. Significantly, the survival analysis correlates positive CPE 
and negative GGH on pulmonary NET cells to very good prognosis. 

CPE is involved in the removal of C-terminal basic amino acids in brain 
and various neuroendocrine tissues. There are two types of CPE, a 50 kDa 
membrane-bound enzyme and a smaller soluble enzyme (Manser, E. etal (1990) 
"Human Carboxypeptidase E. Isolation And Characterization Of The 
cDNA, Sequence Conservation, Expression And Processing in vitro. 
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Biochem J 267:517-525). The former is an amphipathic and secreted enzyme 
(Manser, E. etal. (1991) "Processing And Secretion Of Human 
Carboxypeptidase E By C6 Glioma Cells," Biochem J 280 (Pt 3):695-701). 
Human CPE is located on chromosome 4p33 and no mutations are reported in lung 
cancers. The mutations at Ser202 of mouse CPE affected its expression, enzyme 
activity and intracellular localization (Varlamov, 0. et al. (1996) "Induced And 
Spontaneous Mutations At Ser202 Of Carboxypeptidase E. Effect On 
Enzyme Expression, Activity, And Intracellular Routing," J Biol Chem 
271:13981-13986. A mouse with Cpe/Cpe mutation results in reduced CPE 
enzyme activity and obesity (Naggert, J.K. etal. (1995) "Hyperproinsulinaemia 
In Obese Fat/Fat Mice Associated With A Carboxypeptidase E Mutation 
Which Reduces Enzyme Activity," Nat Genet 10:135-142), and as yet tumors 
have not been reported. The present invention shows that CPE expression is not 
detected in normal bronchial epithelial cells or pneumocytes; however, it is 
elevated in the tumor cells, suggesting that secreted CPE may be a surrogate serum 
marker for non-invasive diagnosis and early detection of pulmonary carcinoid 
tumors. 

The ggh gene may be regulated at both transcriptional and 
posttranscriptional levels. In LCNEC cells, ggh mRNA is increased according to 
the microarrays, which is consistent with the increase in GGH protein based on 
IHC, indicating transcriptional activation. Although anti-GGH antibody detected 
the upregulation in three of four SCLC cases, mRNA elevation is not detected by 
the microarrays, suggesting an alternative posttranscriptional mechanism. The 
study of mechanism(s) of ggh transcription and translation is of importance, not 
only because it has diagnostic and prognostic value, but also because the GGH 
protein (as lysosomal enzyme that catalyzes the hydrolysis of folylpoly-y- 
glutamates and antifolylpoly-y-glutamates by the removal of y-linked 
polyglutamates and glutamate (Wang, Y. etal. (1993) "The Properties Of The 
Secreted Gamma-Glutamyl Hydrolases From H35 Hepatoma 
CELLS,"Biochim Biophys Acta 1 164:227-235)) are known to be implicated in 
methotrexate resistance in sarcoma (Waltham, M.C. etal. (1997) "gamma- 
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Glutamyl Hydrolase From Human Sarcoma HT - 1 080 Cells: 
Characterization And Inhibition By Glutamine Antagonists," Mol 
Pharmacol 51:825-832; Li, W.W. (1993) "INCREASED ACTIVITY Of Gamma- 
GLUTAMYL HYDROLASE IN HUMAN SARCOMA CELLLINES: A NOVEL MECHANISM 

Of Intrinsic Resistance To Methotrexate (MTX)," Adv Exp Med Biol 
338:635-638) and leukemia (Longo, G.S. etal. (1997) "gamma Glutamyl 
Hydrolase And Folylpolyglutamate Synthetase Activities Predict 
Polyglutamylation Of Methotrexate In Acute Leukemias," Oncol Res 
9:259-263; Rots, M.G. etal (1999) "Role Of Folylpolyglutamate 
Synthetase And Folylpolyglutamate Hydrolase In Methotrexate 
Accumulation And Polyglutamylation In Childhood Leukemia," Blood 
93:1677-1683). 

In sum, pulmonary neuroendocrine tumors are found to vary dramatically 
in their malignant behavior and classification based on histological examination is 
often challenging. In searching for molecular markers for these tumors, a cDNA 
microarray expression analysis is conducted. The analysis involved 9,984 genes in 
tumor cells isolated by laser-capture microdissection from primary tumors of 
typical carcinoids (TC), small cell lung cancers (SCLC), large cell neuroendocrine 
carcinomas (LCNEC), and a combined small cell and large cell neuroendocrine 
carcinoma. An unsupervised, hierarchical clustering algorithm resulted in a 
precise classification of each tumor subtype, according to the newly proposed, 
modified histological classification. Selection of genes with significant variance 
resulted in the identification of 198 statistically significant genes (p<0.004) that 
accurately discriminated between three predefined tumor subtypes. Of 198 genes, 
48 classifier genes are identified. Changes in expression of three representative, 
differentially expressed genes were internally validated by real-time RT-PCR. In 
addition, expression of two classifier gene products, carboxypeptidase E (CPE) and 
y-glutamyl hydrolase (GGH), are validated by immunohistochemistry. Kaplan- 
Meier survival analysis reveals that CPE immunostaining is a statistically 
significant predictor of good prognosis, whereas GGH expression correlated with 
poor prognosis. Thus, this molecular profiling accurately classifies pulmonary 
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neuroendocrine tumors and permits the identification of 48 classifier genes and two 
novel prognostic markers. 

All publications and patents mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or 
5 patent application was specifically and individually indicated to be incorporated by 
reference. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further 
modifications and this application is intended to cover any variations, uses, or 
10 adaptations of the invention following, in general, the principles of the invention 
and including such departures from the present disclosure as come within known 
or customary practice within the art to which the invention pertains and as may be 
applied to the essential features hereinbefore set forth. 



